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PATENT APPLICATION 

IN THE UNITED STATES PATENT 
AND TRADEMARK OFFICE 

Applicants: 
Grreg Firth 

U.S. National Phase (35 USC 371) 
Application Based on PCT/GB98/00840 
filed 20 March 1998 

Serial No.: To Be Determined 

Filed: Herewith 

For: Extraction and Utilisation of VNTR 
Alleles 

Group Art Unit: To Be Determined 
Examiner: To Be Determined 

PRELIMINARY AMENDMENT ACCOMPANYING FILING 
OF NATIONAL STAGE APPIJCATTON UNDER 35 U.S C 371 

BoxPCT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

AMENDMENT 

Please amend claims 3, 4, 5, 6, 9, 13, 15, 19, 24, 26 and 27 as follows: 

In the Claims : 

-3 . [AMENDED] The method of claim 1 [or claim 2], wherein in step c) the 
VNTR repeat sequences are removed jfrom the 5'-flanking VNTR amplimers, and in step d) the 
VNTR repeat sequences are removed from the 3 '-flanking VNTR amplimers. ~ 



"EXPRESS MAIL" mailing label 
No. EM362734296US 

Date of Deposit: September 10, 1999 

I hereby certify that this paper (or fee) is 
being deposited with the United States 
Postal Service "EXPRESS MAIL POST 
OFFICE TO ADDRESSEE" service under 
37 C.F.R. §1.10 on the date indicated above 
and is addressed to the Assistant 
Commissioner for Patents, Washington, 
D.C. 20231. 




Richard Zimmermann 



~4. [AMENDED] The method of claim 1 [any one of claims 1 to 3], wherein 
in step c) and/or d) the adaptor or primer used contains at least one phosphorothioate bond.-- 

—5, [AMENDED] The method of claim 1 [any one of claims 1 to 4], wherein 
step e) is performed using as primers, either successively or together, both the mixture of 5'- 
flanking VNTR amplimers and the mixture of 3'-flanking VNTR amplimers.-- 

—6. [AMENDED] The method of claim 1 [any one of claims 1 to 5], wherein 
there is used in step e) genomic DNA of one or more members of the species of interest which 
manifest a trait of interest, whereby the resulting mixture of VNTR alleles and their flanking 
sequences is representative of those which manifest the trait of interest.— 

~9. [AMENDED] The method of claim 6 [any one of claims 6 to 8], wherein 
at least one VNTR allele and its flanking sequences representative of those which manifest the 
trait of interest, is hybridised with a mixture of VNTR alleles and their flanking sequences 
representative of those which do not manifest the trait of interest, and at least one match and/or 
at least one mis-match is selected to provide at least one VNTR allele or fragment thereof which 
is characteristic of the trait of interest. ~ 

~13, [AMENDED] The portion as claimed in claim 1 1 [or claim 12], wherein 
each member of the mixture has an adaptor at each of its 3'-end and its 5 -end.- 



—15. A portion of genomic DNA of a species of interest, said portion consisting 
essentially of a representative mixture of 3 '-flanking regions of a chosen VNTR sequence, each 
member of the mixture carrying an adaptor at its 3*-end, and a representative mixture of 5- 
flanking regions of a chosen VNTR sequence, each member of the mixture carrying the same [an] 
adaptor at its 5'-end,— 

-19. [AMENDED] The method of claim 16 [any one of claims 16 to 18], 
wherein at least one VNTR allele and its flanking sequence representative of those which manifest 
the trait of interest, is hybridised with a mixture of VNTR alleles and their flanking sequences 
representative of those which do not manifest the trait of interest, and at least one match and/or 
at least one mis-match is selected to provide at least one VNTR allele or fragment thereof which 
is characteristic of the trait of interest, ~ 

-24. [AMENDED] The method of claim 22 [or claim 23], wherein the at least 
one allele and its flanking sequences representative of those which manifest the trait of interest, 
is provided with 3 -overlapping ends.-- 

-26. [AMENDED] The method of claim 1 or claim 16 [claims 1 to 10 or 16 to 
20], wherein the VNTR allele and its flanking regions, or the mixture of VNTR alleles and their 
flanking regions, is analysed by being applied under hybridsation conditions to an array of 
immobilised VNTR alleles and/or their flanking regions. - 

—27, [AMENDED] A kit comprising protocols and reagents for performing the 
method of claim 1 or claim 16 or claim 24 [any one of claims 1 to 10, 16 to 24 or 26]. 



REMARKS 

The foregoing amendments are made to eliminate various multiple dependencies 

and place the claims in condition for allowance. An early notice of allowance is hereby solicited. 

Should the Examiner wish to discuss any matter of form or substance, he or she is invited to 

contact the undersigned attorney at the number listed below. 

Respectfully submitted, 

MARSHALL, OTOOLE, GERSTEIN, 
MURRAY & BORUN 



By 




by S. Sharp, Rfeg. No. 31,879 
6300 Sears Tower 
233 South Wacker Drive 
Chicago, Illinois 60606-6402 
(312) 474-6300 

Chicago, Illinois 
September 10, 1999 
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EXTRACTION AND UTIMSATION OF VNTR ALLELES 



Glossary of Terms and Abbreviations 



adapter 



15 



25 



AFLP 
allele 

amplimer 



DNA 

DNA fingerprint 

20 GMS 

individual 
heterodupiex 



heterozygygous 



homozygous 

30 locus 

mis-match 



nucleotide sequences, usually comprising annealed 

complementary oligonucleotides, ligated to DNA 

fragments that allow specific amplification and 

manipulation of those fragments 

amplified fragment length polymorphism 

one of several possible alternative sequence variations 

at any one locus 

the product, or pool of products, generated by 
amplification with the adapter primer and an 'internal 
primer' 

deoxyribonucleic acid 

the display of a set of DNA fragments from a specific 
DNA sample 

genomic mis-match scanning 

a member of any species subject to investigation 

a duplex of two alleles derived from different individuals, 

sets of individuals or populations 

alleles at the same locus of each of the paired 

chromosomes in a diploid ceil being different 

homoduplex a duplex of alleles derived from the same 

individual, set of individuals or population 

alleles at the same locus of the paired chromosomes of 

a diploid eel! being identical 

a specific position on a chromosome 

one or more bases in a duplex that fail to form stable 
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NASBA 
PGR 
RAPD 
5 RDA 
RFLP 
trait 

TRAIT 

10 

VNTR 



15 

Field of the invention 

The field of this invention is the detection of polymorphic 
variation in complex genomes, which is the mainstay of the study of 
20 hereditary traits in ail organisms. Since polygenic traits far outweigh those 
that are monogenic, a procedure that allows the isolation in concert of 
several informative polymorphisms within the complex genomes of multiple 
individuals would provide an extremely powerful tool for the investigation of 
hereditary traits. 

25 The invention differs fundamentally from ail other techniques 

that have been previously employed by: 

(0 permitting mass generation of VNTRs quickly and easily from 

DNA 

(ii) generating polymorphisms that are both linked and 
30 informative for a trait; 

(iii) reproducing and preserving the polymorphic allele, as it 



hydrogen bonds with opposing bases 
nucleic acid sequence based amplification 
polymerase chain reaction 
random-amplified DNA markers 
representational difference analysis 
restriction fragment length polymorphism 
a distinguishing feature or characteristic manifesting 
itself physically, chemically or biologically 
Total Representation of Alleles that are Informative for a 
Ira it 

variable number tandem repeat, also referred to as 
simple sequence repeats (encompassing all repeats of 
two or more nucleotides that may be continuous or 
interrupted by short non-repetitive sequence, including 
minisatellites and microsateilites). 
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occurs in the genome; 

(iv) negating problems that are features of other polymerase 

chain reaction based techniques; including miss priming, reaction 
contamination and generation of spurious products; 
5 (v) negating the need for investigations to be confined to families 

of closely-related individuals; 

(vi) permitting the analysis of polygenic traits; 

(vii) having a sparing requirement for DNA starting material. 
The invention therefore represents a major advancement in 

10 the ability of workers in the biomedical fields to screen simple or complex 
genomes, rapidly and with fidelity, for polymorphisms co-segregating with 
advantageous or deleterious monogenic or polygenic hereditary traits. 
There is enormous potential for advancement of medicine, veterinary 
medicine, forensic science, agriculture, animal husbandry and 

15 biotechnology, by the generation of polymorphic markers co-segregating 
with hereditary disease or traits of social or economic importance. The 
invention will also serve to facilitate mutation analysis for all relevant 
organisms. 

20 Introduction 

DNA is a double stranded linear polymer composed of 
repetitions of four mononucleotide units. The sequence in which these 
units are arranged gives rise to a genetic code, referred to as the genome. 
Although the genomes of ail individuals within a species are essentially 

25 homologous, subtle variations exist which impart individuality. Locations of 
the genome at which more than one sequence variation may exist are 
termed polymorphisms, each variant of that sequence representing an 
allele. Polymorphisms in gamete-forming germinal cells will be inherited by 
subsequent generations of progeny. By studying the combination of 

30 polymorphisms in the genome of an individual a unique code (fingerprinf ) 
can be assigned and the ancestry of that individual can be determined. 
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Furthermore, a polymorphism found to be linked and co-segregating with a 
particular genetic trait or hereditary disease may be used as a marker for 
genetic screening of that trait or disease in other individuals. 

The study of advantageous or deleterious hereditary traits in 

5 complex genomes has been the subject of considerable interest due to its 
economical, medical and social implications. The establishment of 
protocols that allow the comparison of nucleic acid sequences in complex 
genomes and the isolation of differences unique to a subset of those 
sequences is a fundamental requirement of this field of study. 

io A number of protocols have been used in animals and plants 

for the comparison of nucleic acid sequences and isolation of differences 
between those sequences in individuals. These protocols involve 
restriction fragment length polymorphism (RFLP), random-amplified 
polymorphic DNA markers (RAPD), amplified fragment length 

15 polymorphism (AFLP), representational difference analysis (RDA). genomic 
mis-match scanning (GMS), and linkage analysis of variable number 
tandem repeats (VNTR). These protocols detect polymorphisms by 
assaying subsets of the total DNA sequence variation in a genome. 
Polymorphisms detected by RFLP, AFLP, and RDA rely on the generation 

20 of a fingerprint ladder by gel-electrophoresis which reflects restriction 
fragment size variation. RAPD polymorphisms result from sequence 
variation at primer binding sites and differences in length between primer 
binding sites. GMS polymorphisms result from sequence variation within 
heterohybrid molecules comprising restriction fragments derived from two 

25 related individuals. Linkage analysis involves the detection of length 

variation of variable number tandem repeats (VNTRs) and co-segregation 
of one allele with a trait of interest. 

RFLP 

30 RFLP analysis relies on the cleavage of a nucleic acid 

sequence by restriction endonucleases and separation of the resulting 
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fragments by gel electrophoresis. The fragments are blotted onto a 
membrane and hybridized to labelled probes to ailow detection of fragment 
length variation. This technique may be of use in the study of a single 
isolated locus or gene fragment, but where an investigation is not confined 
5 to an isolated sequence it is inadequate. Further limitations are that only a 
small number of the polymorphisms generated may be informative, there is 
a high demand for DNA starting material, and the method is labour 
intensive. 

10 RAPD 

RAPD is a commonly used PCR-based polymorphic marker 
technique in genomic fingerprinting and diversity studies, particularly for 
plant species. This technique involves the use of a single 'arbitrary primer' 
which gives rise to amplification of regions of genome where there is 

15 sufficient homology between the sequences of genomic DNA, in the 5' to 3' 
direction, and that of the arbitrary primer. The amplified products are 
separated by gel electrophoresis. Subtle variations of this method include 
arbitrary primed-PCR (AP-PCR) and DNA amplification fingerprinting 
(DAF). However, the principle of arbitrary priming and amplification of DNA 

20 by PGR for difference analysis is common to all. Advantages compared to 
RFLP are that these methods are more rapid, have a lower demand for 
DNA, and do not require prior knowledge of sequence. A limitation in 
common with RFLP is that each analysis can only compare the genomes of 
two individuals. Although several loci can be evaluated concomitantly by 

25 this method, detection of polymorphisms requires observation of variation 
in band patterns by gel-electrophoresis and is subject to errors of 
superimposition of different alleles of similar electrophoretic mobility. Many 
bands may be faint and difficult to interpret, and it is difficult to achieve 
consistent results in repeat experiments. In common with the majority of 

30 PGR techniques, the results are prone to error by subtle changes in 
reaction conditions, reagent contamination, and the generation of 
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inconsistent banding patterns. This lack of reliability limits the usefulness 
of such techniques in the 'typing' of individuals. 

AFLP 

5 AFLP analysis (EP, A, 0534858; Zabeau M a/.) involves 

restriction endonuciease digestion of DNA and ligation of the generated 
restriction fragments to adapters. Using primers complementary to the 
adapter sequence, the restriction fragments are amplified by PGR, and the 
products are separated by gel-electrophoresis, differences in band patterns 

10 revealing polymorphisms, Microsateliite-AFLP (WO 96/22388; Kuiper M et 
a/.) is a modification of this technique in which two or more restriction 
enzymes, at least one of which cuts at a simple sequence repeat, are used 
to cleave DNA into fragments that are ligated to adapters. The fragments 
are amplified with primers complementary to the adapter sequence. In 

15 common with RAPD, several loci can be evaluated concomitantly by this 
method, but detection of polymorphisms requires observation of variation in 
band patterns by gel-electrophoresis and is subject to errors of 
superimposition of different alleles of similar electrophoretic mobility. The 
ability to score bands on an AFLP fingerprint is compromised by generation 

20 of large numbers of bands of which some may be very faint and difficult to 
interpret. Furthermore, the technique is prone to errors that are common to 
all PGR based techniques, summarised above, and suffers from an inability 
to analyse multiple complex genomes simultaneously. This is compounded 
by the generation of bands, by incomplete restriction of the template DNA, 

25 that do not reflect true polymorphisms. AFLP and RAPD analyses 

therefore share many of the same limitations. An additional problem is that 
AFLPs, rather than being evenly dispersed through out the genome, are 
reported to be clustered around centromeres. Gonsequently, this method 
may not allow the generation of polymorphisms that co-segregate with 

30 sequence differences of interest if they are located at a distance from 
centromeres. This problem is reflected in the reduced rate of 
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polymorphism detection compared to techniques such as linkage analysis. 
Furthermore, the complexity of the experimental data derived by AFLP 
becomes exaggerated with increasing complexity of the genome subject to 
analysis. Consequently, although it has been possible to investigate the 
5 genomes of some plant species by AFLP analysis, the relatively complex 
genomes of higher eukaryotic species may be beyond the useful capacity 
of this technique. 

RDA 

RDA involves restriction endonuclease digestion of DNA, 
ligation of the fragments to adapters and amplification by PGR. Differences 
between compared genomes are selected by successive rounds of 
subtractive hybridization and kinetic enrichment such that regions of 
difference predominate. This technique is prone to erroneous results 
through reaction contamination and generation of spurious products. In 
addition, a fundamental requirement of RDA is the availability of families of 
closely related individuals, some of which are manifesting the trait of 
interest. Where RDA is performed on anything other than closely related or 
highly inbred genomes the multiplicity of differences is too vast for succinct 
and useful analysis. 

GWIS 

GMS is technique for mapping regions of identity-by-descent 
of two related individuals. The entire genome is compared in a single 
25 hybridisation that has a high demand for DNA since the genomic samples 
are not amplified. Freedom from the need of prior map information, 
conventional markers, or gel electrophoresis are to its advantage. 
However, the method is restricted to use on the genomes of only two 
related individuals. 

30 Restriction fragments of the two genomes are hybridised, one 

of which having been methylated such that heterohybrid molecules can be 



15 
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distinguished through their resistance to digestion by Dpn I and Mbo 1 that 
cleave only fully methylated and unmethylated molecules, respectively. 
Heterohybrids containing homologous strands that lack mis-matches are 
selected and used to probe an array of mapped clones. Although the mis- 

5 match proteins used in this technique may resolve point mutations 

polymorphisms comprising more substantial mis-matches that are beyond 
the limit of this system are not detected. Therefore, in keeping with RFLP, 
AFLP, RAPD, and RDA, GMS tends to resolve binary polymorphisms that 
may have low informative power. 

10 In ail of the above techniques it is essential that there is a 

difference in nucleotide sequence at or between primer binding sites or 
endonuclease restriction sites in order to detect polymorphisms. This 
highlights the major limitations of these procedures, because in many 
instances a mutation giving rise to a hereditary trait will not create a 

15 sequence difference detectable by variation in primer binding or restriction 
enzyme digestion. Consequently, a polymorphism linked to a trait of 
interest will not be identified using these techniques. GMS detects 
polymorphisms that are incidental to the restriction site and is spared some 
of the limitations of the other methods. However, in contrast to VNTR 

20 polymorphisms, the majority of polymorphisms detected by all of these 
techniques are not informative. 

Linkage analysis 

Linkage analysis is an indirect molecular genetic strategy that 
25 involves the systematic comparison of the inheritance of polymorphic 
VNTRs with the trait of interest in families in which that trait is present. 
There are a number of types of VNTR, including minisateiiites and 
microsateilites, a feature of al! being the repetition of elements of simple 
sequences. They are polymorphic by virtue of variation in the number of 
30 times each element is repeated, giving rise to alleles with variation in 
length. Since several alternative alleles may exist at any one locus, in 
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contrast to polymorphisms based on variation in primer binding or 
restriction enzyme digestion, VNTR polymorphic alleles tend to be highly 
infonnative. Consequently, where co-segregation of a trait with a particular 
VNTR allele is demonstrated, the allele may be used as a marker for that 
5 trait, or may be used as a vehicle to facilitate identification of the molecular 
genetic basis of the trait. Microsateilites are ubiquitously distributed 
throughout all eukaryotic genomes. Consequently, linkage analysis with 
microsateilites is associated with the highest polymorphism detection rate 
of the genetic screening methods. Indeed, systematic microsatellite 

10 analyses have already been responsible for many advances in the 
understanding of certain types of common cancer. Linkage analysis 
therefore has advantages compared to other related methods of difference 
analysis, the results of which are very reproducible. However, linkage 
analysis is very time consuming, labour intensive and expensive. 

15 Furthermore, since many analyses are performed individually the overall 
requirement for DNA is extremely high. This is particularly true if a physical 
map of the genome is unavailable for the selection of informative 
microsateilites that are evenly distributed throughout the genome. The 
demonstration of linkage requires the application of elaborate statistical 

20 programs and powerful computer software for analysis of the experimental 
data. This technique is better suited to monogenic defects since the 
statistical analyses required for multigenic traits are particularly complex. 
Unfortunately, multifactorial genetic traits are far more prevalent than 
monogenic defects, making linkage analysis a cumbersome technique for 

25 the investigation of the majority of hereditary traits. 

The characteristics of an ideal protocol for isolation of 
polymorphisms co-segregating with disease in complex genomes would 
include: 

(') the ability to isolate simultaneously and with fidelity the 

30 polymorphisms from complex genomes of several individuals 

('0 the ability to isolate several polymorphisms simultaneously. 
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permitting the analysis of polygenic traits 

(iii) a high detection rate of polymorphisms that co-segregate with 

sequence differences in alt eukaryotic species, including subtle differences 
such as those resulting from point mutations 
5 (iv) no requirement for large families of closely related individuals 

to study traits of interest 

(v) no requirement for physical maps of the genome or prior 
knowledge of genomic sequence 

(vi) a requirement for sparing quantities of nucleic acid samples 
10 for analysis 

(vii) simplicity of use without a need for expensive specialist 
laboratory equipment or computer software 

(viii) potential for widespread application throughout the animal 
and plant kingdoms 

15 (ix) a robust performance with precision, accuracy and fidelity. 

None of the techniques that are currently available fulfil the 
majority of these ideal characteristics. All are compromised by at least one 
of several limitations including: expense; lack of speed; requirement for 
large amounts of DNA; low polymorphism detection rate; an inability to 

20 detect small sequence variations such as point mutations; a lack of fidelity 
with high incidence of artefacts and spurious results; inability to analyse 
several complex genomes concomitantly; an inability to resolve 
simultaneously polymorphisms at multiple loci; an intrinsic need for closely 
related genomes for analysis; a need for prior knowledge of sequence; and 

25 complexity of analysis with a need for expensive equipment and computer 
software. In addition, those techniques that are reliant on large families of 
closely related individuals are further compromised where there are 
discrepancies in lineage, so that paternity testing may be an essential 
preliminary investigation to establish the integrity of each family individual 

30 subject to analysis. 
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The Invention 

The invention is a novel method for generating en masse the 
VNTRs from genomic or synthetic DNA, while preserving each allele with 
its flanking sequence. These alleles may be used to produce a Trngerprint' 

5 by gel electrophoresis, or they may be used as the starting material in 

protocols for genotyping individuals or protocols for isolation of polymorphic 
markers that co-segregate with hereditary traits. The latter may be 
achieved by mis-match discrimination to yield a pool of alleles that are 
common to all individuals manifesting a particular trait Further mis-match 

10 discrimination of these selected alleles with the alleles of individuals in 
which the trait is not present, in solution or fixed to an array, allows 
purification of VNTRs with alleles that are both linked and informative for 
the particular trait The end products, therefore, are designated a Total 
Representation of Alleles Informative for a Jrait (TRAIT). 

15 In one aspect the invention provides a method of making a 

mixture of VNTR alleles and their flanking regions of the genomic DNA of 
one or more members of a species of interest, which method comprises the 
steps of: 

a) dividing genomic DNA of the species of interest into 
20 fragments, 

b) ligating to each end of each fragment an adapter thereby 
forming a mixture of adapter-terminated fragments in which each 3'-end is 
blocked to prevent enzymatic chain extension, 

c) using a portion of the mixture of adapter-terminated 

25 fragments as templates with an adapter primer and a VNTR primer to 
create a mixture of 5 -flanking VNTR amplimers, 

d) using a portion of the mixture of adapter-terminated 
fragments as templates with an adapter primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers, 

30 e) and using genomic DNA of the one or more members of the 

species of interest as template with the mixture of 5'-f!anking VNTR 
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amplimers and the mixture of 3'-flanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

The species of interest may be any eukaryotic species from 
the plant and animal kingdoms. Although they do not show repetitive 

5 sequences in quite the same way, prokaryotic species are also envisaged. 
An individual member of a species may be for example a plant or a micro- 
organism or an animal such as a mammal. 

In another aspect the invention provides a portion of genomic 
DNA of one or more members of a species of interest, said portion 

10 consisting essentially of a representative mixture of alleles of a chosen 
VNTR sequence and their flanking regions. 

The term "representative mixture of alleles" does not 
necessarily imply that all of the possible alleles, or even most of these 
possible alleles, of a chosen VNTR sequence are present. Whether a 

15 particular allele is present or not, e.g. in the mixture generated by the 

method defined above, may depend on the nature of a restriction enzyme 
used in step a) and on other factors. 

The invention also provides a portion of genomic DNA of a 
species of interest, said portion consisting essentially of a representative 

20 mixture of 3'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adapter at its 3'-end, and a representative 
mixture of 5'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adapter at its 5*-end. 

The invention also provides a method of treating nucleic acids 

25 which consist essentially of a mixture of polymorphic alleles, e.g. of a 
chosen VNTR sequence and their flanking regions, or alternatively a 
mixture generated in some other way such as AFLP, microsatellite-AFLP, 
GMS or RAPD, the mixture being representative of those which manifest a 
trait of interest, which method comprises separating and then re-annealing 

30 strands of the mixture, and 
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separating and discarding any mis-matches. Preferably the method 
comprises the additional step of hybridizing the said mixture with a mixture 
of corresponding polymorphic alleles, e,g. of the chosen VNTR sequence 
and their flanking regions, or alternatively a mixture generated in some 
5 other way such as AFLP, microsatellite-AFLP, GMS or RAPD, which are 
representative of those which do not show the trait of interest, and selecting 
mis-matches to provide a mixture of polymorphic alleles which are 
characteristic of the trait of interest. 

The invention also provides kits comprising protocols and 
10 reagents for performing the methods herein described. 

The salient points of the invention may be represented as 

follows: 

(i) reduction in the complexity of the genome by double positive 
selection of genomic DNA restriction fragments that both ligate to a chosen 

15 adapter and contain a sequence with homology to a chosen primer, 

employing enrichment of such products by PGR, NASBA or other methods; 

(ii) introduction of the selected enriched fragments to a genomic 
template in such a way that allows recreation of the VNTRs with the 
flanking sequences within that template, whilst preserving the allele and 

20 therefore the informativeness of each locus; 

(iii) mis-match discrimination of the generated VNTR alleles to 
remove any spurious products of amplification that occur through miss 
priming events, reaction contamination, and subtle variation in reaction 
conditions; 

25 (iv) selection of only those synthesised VNTRs alleles that are 

common to all individuals manifesting a particular trait or those alleles that 
predominate in such a group of individuals. This is achieved by strand 
dissociation and hybridization, giving rise to mis-match containing 
heterodupiexes of alleles at any locus that differ among the individuals. 

30 These complexes can be rejected by mis-match discrimination. The 
enriched alleles that are common to individuals manifesting the trait or 
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predominate in that group are sufficiently pure to be used as starting 
material in other DNA based studies that utilise polymorphic alleles; 
(v) rejection of those alleles common to all individuals 

manifesting a particular trait or predominating in such a group that are also 

5 common to individuals in which the trait is not present. This is achieved by 
strand dissociation and hybridization of the VNTR alleles that are common 
to individuals manifesting a particular trait of interest or predominating in 
that group with the VNTR alleles of individuals in which the trait is not 
present followed by a further round of mis-match discrimination. In this 

io case mis-match containing heteroduplexes and homoduplexes derived 
from the individuals manifesting the hereditary trait are selected. These 
represent polymorphic VNTRs with an informative allele that co-segregates 
with the particular trait of interest. Amplification of these VNTRs from the 
DNAs of individuals manifesting the trait of interest yields the informative 

15 alleles that may be used as DNA markers. 

The invention provides a method of selecting genetic 
elements that are common to one pool of individuals but are absent in a 
second or present at a lower level. An obvious variation on this theme is 
the selection of genetic elements that are absent in one pool of individuals 

20 but are present in a second by judicious selection, during the course of the 
procedure, of allele duplexes that are either with or without a mis-match. 

For simplicity, the protocol may be considered in three 
separate sections: generation of VNTR alleles; mis-match discrimination; 
and selection of alleles informative for a trait. The text is illustrated with a 

25 number of diagrams to facilitate description of the invention. 

Generation of VNTR alleles 

The protocol describes a method of generating with fidelity 
the VNTR alleles with their flanking sequences en masse from the genomic 
30 DNA of one individual, or the pooled DNAs of several individuals. The 
initial step involves fragmentation of the genomic DNA physically, 
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chemically or enzymaticaliy. the aim of which is to obtain genomic 
fragments that contain VNTRs ail of which being of an ampiifiable length. 
The use of one or more restriction enzymes gives rise to uniform 
fragmentation of the genomic sample and constitutes the preferred 

5 technique. With judicious choice of restriction enzymes that cut frequently 
there is potential for generation en masse of every VNTR of the chosen 
type within a genome or pool of genomes since virtually ali fragments will 
be sufficiently small for efficient amplification, it should be noted that the 
phenotype of individuals contributing genomic DNA for this fragmentation is 

10 unimportant, indeed, the genomes restricted in this way need not be 

derived from any individual, or pool of individuals, that have been selected 
by virtue of their phenotype for investigation of a particular trait of interest. 



Genomic DNA of one or more individuals of the species ofinterest containing VNTRs ( ^^^ ) 



I Restriction enzyme<s) 



15 

The restriction fragments are tigated to an adapter by which 
the fragments may be amplified or manipulated. The sequence of the 
longer oligonucleotide contained within the adapter is chosen such that it 
fails to generate any products when added as the primer to an amplification 

20 reaction containing genomic DNA as template. Termini are introduced 
physically, chemically or enzymaticaliy to ail available 3' ends to prevent 
their extension under the influence of a DNA polymerase. They may be 
introduced in one of several ways including: (A) addition of the terminus 
prior to ligation; (B) addition of the terminus following ligation; (C) addition 

25 of the terminus during ligation. The spectrum of available termini that are 
suitable for this purpose include, but are not limited to, dideoxy nucleotide 
triphosphates. 
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(A) A method by which termination may be achieved of ail 3' ends 

with dideoxynucieotide triphosphates prior to ligation is through the action 
of a DNA polymerase, including Terminal deoxynucieotidyl transferase, in 
the presence of a chosen dideoxynucieotide triphosphate. 











ysomsem. 





Terminal deoxynucieotidyl 
transferase and a ddNTP 



^^^^ 



Ligation then follows with an adapter containing an 
appropriate 5' recess that accommodates the dideoxynucieotide 
io triphosphate terminus on each strand. 



1 
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I 14 DNA ligase 
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(B) A method by which termination may be achieved of all 3' ends 

15 with dideoxynucieotide triphosphates following ligation is through the action 
of a DNA polymerase in the presence of a chosen dideoxynucieotide 
triphosphate. 
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T4 DNA ligase 



DNA poiym erase + a ddNTP 
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IHH 


ddt 







(C) A method by which the ligated 3* ends can achieve 

termination during the ligation process is through incorporation of a suitable 

5 3' terminus and a 5' phosphate on the shorter oligonucleotide during its 
synthesis such that this oligonucleotide will form a covalent bond with the 
genomic fragments under the influence of a enzyme such as T4 DNA 
ligase. Again, suitable termini include but are not limited to 
dideoxynucieotide phosphates, there being a variety of other modifications 

10 and deoxynucleotide analogues that will prevent extension of the 3' ends 
under the influence of a DNA polymerase. 



ddfH 



T4 DNA ligase 



15 

Of these, method (A) was found to be the most reliable since 
every genomic fragment that achieves ligation to an adapter is guaranteed 
to have an appropriate terminus. In addition, it guarantees that inter- 
fragment ligation is impossible. Method (C) also guarantees that each 
20 ligated 3' end possesses a terminus. However, unlike in the case of 
method (A), inter-fragment ligation can occur. 
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Since it is likely that some fragments will contain sites at 
which one DNA strand is nicked, in order to prevent polymerisation from 
these sites it is preferable to incorporate into them suitable termini. This 
may be achieved in a number of ways including, but not limited to, the 

5 incubation of the terminated and ligated genomic fragments with a DNA 
polymerase in the presence of ail dideoxynucieotide triphosphates. 

The longer oligonucleotide that is contained within the 
adapter may be used as the adapter primer in amplification reactions 
containing the genomic fragments that have been appropriately ligated and 

10 blocked by addition of termini at all potential sites of polymerisation. 
However, in the absence of 'internal' priming from another nucleotide 
sequence, the amplification of DNA is impossible. However, if another 
nucleotide sequence successfully anneals and achieves polymerisation to 
the limit of the adapter, an adapter primer binding site is created. Binding 

15 of the adapter primer will allow polymerisation of DNA to the limit of the 
annealed nucleotide sequence. If the nucleotide sequence represents a 
primer, or represents a nucleotide sequence containing a primer binding 
site, introduction of the adapter primer and the 'internal primer" allows 
specific exponential amplification of products only from those fragments 

20 that successfully ligated to the adapter and contain DNA homologous to 
that of the annealed nucleotide sequence. 

If an oiigonucieotide with sequence homology to a chosen 
VNTR is used as the internal primer, only those fragments that have ligated 
successfully to the adapter and contain the targeted VNTR will be capable 

25 of amplification. This gives rise to 'amplimers' that flank each VNTR, 
comprising genomic sequence limited by a restriction site for the chosen 
restriction enzyme and VNTR sequence with homology to the chosen 
VNTR primer. 
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Annealing of a VNTR primer ( ) 



ddC 



□dd 



Extension of the annealed primer 
by a ONA polymerase creates 
an adapter primer binding site 



□dd 



ddEI 



1 



Introduction of the adapter primer 
{ iliy^M— a } and amplification 
generates amplimers that 
constitute one flank of all VNTRs 



A number of different types of VNTR sequence have been 
identified in a diverse range of species. Tliese include, amongst otfiers, 
the dinucleotide repeats, trinucleotide repeats and the tetranucleotide 

10 repeats. Since the (AC)n dinucleotide repeat constitutes the most common 
VNTR that occurs in the majority of species, primers of appropriate 
sequence to generate ampiimers for this VNTR may be chosen. It can be 
seen that the introduction of an (AC)n primer will give rise to amplimers that 
represent one flank of the VNTRs, and introduction of a (GT)n primer will 

15 give rise to amplimers that represent the other flanl< of these VNTRs. 
However, VNTRs with long repeat lengths will be over represented in the 
amplimer pool relative to shorter VNTRs by virtue of their greater number of 
primer binding sites. Similarly, the longer alleles will be over represented 
relative to the shorter alleles of the same VNTR due to their greater number 

20 of primer binding sites. This problem is negated by the introduction of 
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degenerate 3' ends on the VNTR primers that prevent polymerisation of the 
annealed primers unless they are aligned with the start: of the flanking 
sequence. The amplification of all VNTRs and all alleles, therefore, will not 
be biased by their repeat lengths. In the case of (AC)n dinucleotide 
5 repeats the following primers may be used: 

(AC)nB, where B = C + G + T 
(CA)nD, where D A + G + T 
(GT)nH, where H = A + C + T 
10 (TG)nV, where V = A + C + G 

Alternatively, amplimers of other VNTR sequences may be 
generated in this manner by introduction of the appropriate target-specific 
primer containing a degenerate 3' end. Indeed, amplimers constituting 

15 genomic sequence that contain or fiank any target-specific nucleotide 
binding site may be generated in the same way. 

In the case of (AC)n dinucleotide repeats, the amplimers 
derived from reactions primed by the (AC)nB and (CA)nD degenerate 
oligonucleotides may be pooled. An obvious alternative is to generate an 

20 amplimer pool by priming amplification reactions with the (AC)nB and 

(CA)nD degenerate oligonucleotides together. However, this is likely to be 
less efficient than performing the reactions separately. Similarly, the 
(GT)nH and (TG)nV primed reactions may be pooled, or reactions 
containing both of these degenerate primers may be performed. Thus, two 

25 amplimer pools may be created, each representing sequences from only 
one flank of each VNTR. 

Amplimers constrtuting the 5' fiank of all VNTRs Ampiimers constrtuting the 3' flank of all VNTRs 

— ^ b\\^l 1 I 
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Since only one of the two flanking sequences of all VNTRs is 
generated in each amplimer pool, the full allele length being absent, the 
products of amplification are non-informative. However, the full length 
alleles, together with their flanking sequences, can be recreated with fidelity 
5 en masse from genomic DNA by hybridisation of the amplimers to that 
genomic DNA and subsequent polymerisation of the annealed sequences. 
As such, the full length ^affected' VNTR alleles of individuals manifesting a 
particular trait of interest may be obtained by hybridisation of the amplimers 
to the genomic DNAs of those individuals. Similarly, the reciprocal reaction 
10 for individuals in which that trait is absent will give rise to the generation of 
full length 'wild type' VNTR alleles and flanking sequences as they occur in 
the genomes of those individuals. Thus, two pools of VNTRs can be 
generated containing alleles derived from 'affected' DNA and alleles 
derived from 'wild type' DNA. A DNA polymerase that is highly processive 
15 is preferred in this application in order to minimise the potential for 
generation of 'stutter bands' that result from strand slippage during 
polymerisation. 

To limit the potential for generation of spurious products by 
'cross-talk' that occurs through the non-specific association of amplimer 
20 strands during hybridisation, it is preferable to remove the VNTR repeat 
sequences from the amplimers since these repeat sequences will be 
responsible for the majority of such cross-talk. This may be initiated in a 
number of ways including, but not limited to, (A) digestion by an enzyme 
with 3' to 5' exonuclease activity; (B) digestion by an enzyme with 5' to 3' 
25 exonuclease activity ; (C) digestion by Uracil DNA glycosylase of an 
amplimer pool generated with primers containing uracil; (D) digestion by 
RNase of an amplimer pool generated with an RNA primer. 
(A) Providing the 5' end of the adapter primer has all four 

nucleotides represented the opposing strand will be similarly endowed. As 
30 such, incubation with an enzyme with 3' to 5' exonuclease activity, such as 
T4 DNA polymerase at 12°C in the presence of only two deoxynucleotide 
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triphosphates, will not lead to significant shortening of the 3' strand 
complementing the adapter primer. The 3* strand complementing the 
VNTR primer, however, will be removed by T4 DNA polymerase if the 
reaction occurs in the presence of the deoxynucleotides that it lacks. 

5 Exonuclease digestion by the enzyme will cease when the first 

deoxynucleotide that is present in the reaction mixture is encountered. The 
5' overhang that is created may be digested with a single strand specific 
exonuclease or endonuciease, including but not limited to Exonuclease VII, 
such that all repeat sequence is removed. The illustration depicts a 

10 scenario for (AC)n and (GT)n primed amplimers: 



□ (AC)n 3' 5' (AC)ns 
3{TG)n 5' 3' 0"G)n[ 



I T4 DNA poiyme 
+ ^ dGTP + dTTP 



rase 



T4 DNA polymerase 
+ dATP + dCTP + 



5' (AC)nii 



Exonuclease Vli 



Exonuclease VII 



15 If a trinucleotide VNTR has been targeted appropriate 

digestion by T4 DNA polymerase in the presence of only one 
deoxynucleotide will be required. For tetranucleotide repeats this method 
is inappropriate and another should be adopted. 
(B) The repeat sequence may be digested with a 5' to 3' 

20 exonuclease, such as T7 gene 6 exonuclease. Phosphorothioate bonds 
retard the activity of this enzyme. Four successive bonds are believed to 
inhibitory. Therefore, if the adapter primer has been synthesised with at 
least four phosphorothioate bonds at its 5' end, if not synthesised 
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completeiy with phosphorothioate bonds, it wil! be resistant to the 5' to 3' 
exonuciease activity of T7 gene 6 exonuclease. If the VNTR primers are 
synthesised with four phosphorothioate bonds at their 3' ends, the action of 
T7 gene 6 exonuclease will digest the VNTR primer leaving four 

5 nucleotides of repeat sequence. The complementary sequence may be 
digested by a single strand specific exonuclease or endonuclease, 
including but not limited to Exonuclease 1, such that all repeat sequence is 
removed from the amplimers apart from four nucleotides in each strand. 
Such a short length of repeat sequence is unlikely to invite the generation 

10 of spurious products by non-specific interaction of strand ends during 
hybridisation. 



3' 



5' 



T7 gene 6 
exonuclease 



T7 gene 6 
exonuclease 



3' 



3' 



T7 gene 6 
exonuclease 



5' 



3' 



T7 gene 6 
exonuclease 



(C) Synthesis of uracil containing VNTR primers, e.g. (GU)nH 

15 and (UG)nV. allows the destruction of these primers in the appropriate 
amplimer pool by the action of Uracil DNA giycosylase. Incubation of the 
digested amplimers with a single strand specific endonuciease, including 
but not limited to S1 nuclease, leads to further digestion of the VNTR 
primers that contains single stranded spaces and ultimately to the removal 
20 of the complementary sequence such that all repeat sequence is removed. 
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](AC)n 3' 
0(UG)n 5' 



Uracil DNA glycosytase 



t i\ ( G)n 5' 



S1 nuclease 



(D) The generation of ampiimer pools with RNA primers based on 

VNTR sequence, using a DNA polymerase with reverse transcriptase 
5 activity, permits the destruction of the VNTR primers by the action of 

RNAse. The complementary sequence may be removed by a single strand 
specific exonuclease or endonuctease. 

There are several methods by which the digested amplimers 
may be hybridised to the genomic DNA of one or more individuals to 
10 generate en masse and with fidelity the VNTR alleles as they occur in that 
template. These include (A) hybridisation and polymerisation of the 
ampiimer pools, either separately in succession or together to genomic 
DNA that may or may not have been fragmented; (B) hybridisation and 
poiymerisation of the amplimers constituting only one flank of each VNTR 
15 to genomic DNA that has been fragmented physically, chemically or 

enzymaticaliy, and then terminated and ligated to an adapter which may or 
may not be the one used to generate the ampiimer pools. In each case, 
the addition of one of many hybridisation accelerators will enhance the rate 
of hybridisation. Particularly under stringent conditions of hybridisation the 
20 use of such accelerators may be preferable. The number of methods by 
which hybridisation may be accelerated is vast but includes the 
incorporation of phenol exclusion, cationic detergents such as cetyl 
trimethylammonium bromide (CTAB), and volume excluding agents such 
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as dextran sulphate. It should be noted that if CTAB is the chosen 
hybridisation accelerator the salt concentrations in the hybridisation mixture 
should be low in order to prevent its precipitation. 
(A) Illustration is given for hybridisation of one amptimer pool to 

5 genomic DNA to permit the reproduction of VNTR alleles in that genomic 
template by a DNA polymerase: 




Hybridisation of the 5' VNTR ampllmersto 
genomic DNA of one or more individuals 




Extension of the annealed 3' end 
by DNA polymerase 



10 



Hybridisation of the second amplimer pool permits 
amplification of all VNTR alleles en masse using the adapter primer: 
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Melting and cooling allows the 3' 
flanking amplimerta anneal to the 
extended strand 



5' 



□ 3' 




IThe VNTR allele and opposing 
flanking sequence is copied by 
DNA polymerase 




1 Amplification of VNTRsfrom the 
genomic DNA under investigation on 
introduction of the adapter primer and 
thermal cycling 
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(B) Illustration is given for hybridisation of one amplimer pool to 

genomic DNA that has been fragmented, terminated and iigated to an 
adapter that may or may not be the same as that as that present in the 
amplimer pools: 




Hybridisation of the 5' VNTR 
amplimers to genomic DNA 
of one or more individuals 



Extension of the annealed 3' end 
by DNA polymerase 



^sssssssssss 



Amplification of the VNTRs from the 
I genomic DNA under investigation 
' on introduction of the adapter primer 

and thermal cycling 



Removal of repeat sequence from the amplimers permits 
10 concomitant Inybridisation of both amplimer pools to genomic DNA while 
limiting the possibility for generation of spurious products through non- 
specific strand association. The generation of spurious products is reduced 
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further by hybridising the amplimers that constitute each flank separately in 
succession. This allows the introduction of further steps to control non- 
specific strand association including the removal of non-hybridised strands 
by incubation with a single strand specific exonuclease or endonuclease 
5 between hybridisations. In the preferred technique only one amplimer pool, 
comprising one flank of each VNTR, is hybridised to terminated and 
adapter-ligated genomic fragments. As such, this negates any possibility of 
non-specific association between amplimer strands of different pools. If 
each amplimer pool is hybridised and polymerised separately in this 
10 manner, the products that are generated in each reaction should be 
identical. Therefore, these products may be combined. 

Hybridisation of the amplimers to the pooled genomes of 
several individuals allows the generation of the VNTR alleles that they 
contain. If this is performed on the pooled genomes of individuals 
15 manifesting a particular trait, and also on those of individuals lacking the 
trait, the 'affected' and 'wild type' alleles that are present in those pooled 
genomes can be synthesised. 

It is preferable to select the affected individuals from a 
defined population such that the same genotype is common to all 
20 individuals of a given phenotype. However, even if these individuals are 
selected from an out-bred population for which there are several genotypes 
that produce a single phenotype, the alleles that co-segregate with the trait 
loci will be present at a higher frequency in the pooled genomes of affected 
individuals than in the reciprocal pooled genomes of wild type individuals. 
25 These alleles will be enriched by successive repetitions of mis-match 

cleavage and amplification. To prevent the allele frequencies from being 
artificially skewed it is preferable to have a large number of individuals 
contributing genomic DNA to each pool. This ensures that the allele 
frequencies in the affected group and wild type group tend to equate to the 
30 general population from which they are derived such that disparity in the 
two is a consequence of linkage disequilibrium with the trait and not 
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another factor. However, if the numbers of affected and wild type 
individuals is limited the selection of matched sibling pairs, one member of 
each pair being affected and the other being a wild type individual, will go 
some distance to balance the allele frequencies of the pooled genomes 
5 other than with respect to the particular trait. 

Mis-nnatch discrimination 

If the VNTR alleles that are generated from the affected 
individuals and the wild type individuals are denatured and allowed to re- 

10 anneal in separate reactions duplex DNA molecules with or without mis- 
matches will result. Due to the VNTR-specific flanking sequences and 
stringent conditions of hybridisation, only alleles that are of the same VNTR 
will re-anneal. Therefore, duplexes possessing mis-matches contain 
alleles of the same VNTR that are of unequal size or they contain spurious 

15 products of amplification. Alleles of similar size that re-anneal will form 
perfect duplexes. 

The molecules that contain a mis-match may be digested with 
an enzyme that acts upon single stranded DNA or an enzyme that is able 
to detect conformational irregularities in DNA. Suitable enzymes include 

20 but are not limited to S1 nuclease and T4 endonuclease VI I. 
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Alleles shared by all individuats 



Alleles that differ among individuats 
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Denature and re-anneal 



1 I 



T4 endonuclease VII 



'mm 
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Of these two enzymes, T4 endonuclease VII has proved to be 
the most reliable and efficient enzyme in this application and has been 
found to digest efficiently in a range of DNA polymerase buffers while 
tolerating carry-over of CTAB from the hybridisation reaction. It cleaves 

10 both strands of a mis-match containing molecule leaving staggered ends, 
each strand being cleaved 3' with respect to the mis-match. 

Cleavage is likely to occur within the repeat sequence 
creating ends that may interact non-specificaliy during the subsequent 
amplification process and resulting in the generation of spurious products. 

15 To obviate this problem the repeat sequences may be digested from the 
cleaved duplexes. This may be achieved in a number of ways, including 
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(A) by the action of a 3' to 5' exonuclease including but not limited to 
Exonuciease ill, togetiier with a single strand specific exonuclease or 
endonuciease, having protected all DNA strands prior to T4 endonuclease 
Vll digestion with protective termini including but not limited to a- 
thiophosphate groups or a 3' overhang; (B) by the action of a 5' to 3' 
exonuclease including but not limited to T7 gene 6 exonuclease, together 
with an exonuclease or endonuclease, having protected all DNA strands 
prior to T4 endonuclease Vll digestion with protective groups including but 
not limited to phosphorothioate bonds incorporated in to the adapter primer. 

By inclusion of phosphorothioate bonds in the adapter primer 
the 5' ends of all molecules containing the adapter primer will be resistant 
to the 5' to 3' exonuclease activity of T7 gene 6 exonuclease. However, the 
5' ends created by T4 endonuclease Vll cleavage will be susceptible to this 
enzyme. 



T7 gene 6 exonuclease + single strand 
specific exonuclease or endonuclease 



1 — i 



Amplification 
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It is possible that some molecuies will escape complete 
cleavage by T4 endonuciease VI! acquiring merely a single stranded nick. 
However, such nicks are susceptible to digestion by T7 gene 6 
exonuclease, though only the nicked strand would be digested if this 
5 enzyme was used in concert was a single strand specific exonuclease. On 
the other hand, a single strand specific endonuciease, including but not 
limited to SI nuclease, would cleave the complementary single strand that 
is exposed by action of T7 gene 6 exonuclease in molecules receiving 
single stranded nicks such that both strands become disrupted. Thus, 
10 enzymes such as S1 nuclease in concert with T7 gene 6 exonuclease 
would lead to the complete digestion of all T4 endonuciease VII digested 
molecules irrespective of whether one or both strands was cut. 

S1 nuclease has proven successful in this role, being capable 
of efficient digestion of single stranded DNA under alkaline conditions 
15 created by the T7 gene 6 exonuclease buffer. However, some non-specific 
digestion of DNA may occur with this enzyme. Since those molecuies 
receiving single stranded nicks by the action of T4 endonuciease Vil are 
likely to be few, it may be preferable to use a single strand specific 
exonuclease that is less likely to act in this way. Among such enzymes are 
20 included Exonuclease I and Exonuclease VI!. Molecules that lack a mis- 
match are resistant to this regime of digestion and may be enriched by 
amplification. In order to minimise the generation of 'stutter bands' that 
result from strand slippage and polymerase errors during the amplification 
reaction, the number of cycles of amplification should not exceed that 
25 which gives adequate yields of product. 

In addition to T7 gene 6 exonuclease, Exonuclease III may 
act at nicks in DNA molecules. In the absence of phosphorothioate bonds 
within the adapter primer this enzyme would create long 3' overhangs in 
nicked molecules on digestion to completion. Therefore, inclusion of a 
30 single strand specific endonuciease or exonuclease that would remove 
these overhangs would allow the elimination of the cleaved molecule 
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irrespective of whether T4 endonuciease Vll disrupted one or both strands 
in a mis-match containing duplex. However, in order to obviate the need 
for the additional step comprising protection of the 3' ends of all DNA 
molecules prior to mis-match cleavage the use of T7 gene 6 exonuciease is 
5 preferred since protection of the 5' ends that is required for use of this 
enzyme is easily achieved by incorporation of phosphorothioate bonds into 
the adapter primer. 

Another method by which cleaved molecules could be 
removed is by addition of a hapten, including but not limited to biotin-16- 
1 0 dUTP, at the sites of cleavage followed by physical separation of the 

cleaved molecules by the affinity of the hapten to another chemical. This 
could be achieved by termination of the 3' ends of all molecules prior to the 
mis-match cleavage procedure such that they are inert in the presence of a 
DNA polymerase. Suitable temriini include but are not limited to 
15 dideoxynucleotide triphosphates which may be incorporated by a DNA 
polymerase including but not limited to Terminal deoxynucleotidyl 
transferase. Subsequent incubation of the cleaved molecules with biotin- 
16-dUTP in the presence of a DNA polymerase, such as Terminal 
deoxynucleotidyl transferase, will give rise to biotinylation of only those 
20 molecules which lack temninated 3' ends. Separation of the biotinylated 
molecules through binding to streptavidin could then follow. 

In a similar manner, since molecules cleaved by T4 
endonuciease Vll have a 3' overhang these molecules could be removed 
through capture by single stranded binding proteins or chemicals that 
25 possess an affinity for single stranded DNA. It is likely that the overhang 
created by T4 endonuciease Vll will be too small for efficient selection of 
the cleaved molecules by this method. However, they could be lengthened 
specifically by incubation with a DNA polymerase, including but not limited 
to Terminal deoxynucleotidyl transferase in the presence of one or more 
30 deoxynucleotide triphosphates, having terminated all 3' ends of the DNA 
molecules prior to mis-match cleavage with suitable termini that render 
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them inert in the presence of a DNA polymerase. 

Physical separation of DNA molecules is cumbersome and 
relatively inefficient compared to separation by enzymatic means. 
Furthermore, the removal of molecules that possess single stranded nicks 
5 is likely to be unsuccessful. For these reasons methods of enzymatic 
differentiation of DNA species is preferred. 

Reiteration of several rounds of denaturation, hybridisation 
and mis-match cleavage successfully eliminates all spurious products of 
amplification. Furthermore, it reduces to homozygosity all VNTRs such that 
10 only the most common allele of each VNTR remains, or it tends to eliminate 
those VNTRs for which many alleles are present with equal frequency. 
Rapid transition from the temperature of denaturation to that of annealing is 
required to prevent preferential annealing of identical sized alleles. This is 
may occur if the transition from the denaturation temperature to the 
15 annealing temperature is protracted. A hybridisation accelerator may be 
included to enhance the efficiency of hybridisation. This process carried 
out in parallel for the 'affected' VNTR alleles as well as the 'wild type' VNTR 
alleles will tend to achieve identical reduction to homozygosity and the 
generation of balanced allele frequencies. However, for a number of 
20 VNTRs the allele frequencies in the affected and wild type groups at the 
end of the mis-match cleavage procedure will be significantly different. 
Providing that the trait of interest is the only feature distinguishing the two 
groups of individuals from which the VNTRs were derived alleles that are 
over represented in the affected group relative to the wild type group must 
25 co-segregate with that trait. These are markers of the trait and should be 
selected. 

The effect of reiterated mis-match cleavage on the allele 
frequencies of a VNTR can be illustrated with a basic scenario ignoring the 
efficiency of digestion, the effects of polymerase errors and the second 
30 order kinetics of hybridisation. Consider a VNTR for which three alleles are 
present as follows: 
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STARTING SCENARIO 



Alleles ABC 

Allele frequency '^U 

Ratio 2 1 1 



if the alleles are denatured and allowed to re-anneal duplex 
molecules with or without a mismatch will result. The proportion of each 
allele that forms a perfect duplex will depend on its allele frequency. All 
10 mis-match containing molecules theoretically would be susceptible to 

digestion by T4 endonuclease Vil and would be eliminated. Thus, after the 
first round of mis-match cleavage the amounts and ratios of each allele 
remaining would be: 



15 Alleles ABC 
Amount remaining '^^6 ^^e 

Total remaining %6 
Ratio 4 1 1 

Allele frequency "^^e ^^e ^5 

20 

After a second round of mis-match cleavage the allele 
frequencies would change further; 



Alleles 
25 Amount remaining 
Total remaining 
Ratio 

Allele frequency 



ABC 



16 1 1 



30 



After the 3rd round the theoretical allele frequencies would be 

as follows: 



wo 98/42867 



-36- 



PCT/GB98/00840 



Alleles ABC 

Amount remaining ^^^^324 ^^324 ^ 324 

Total remaining ^^%4 

Ratio 256 1 1 

Allele frequency ^^^^253 ^^258 ^^258 



Therefore, after two rounds one allele wouid predominate 
markedly. After a further round this allele would be present virtually 
exclusively. The ratio of the total amount of this VNTR remaining, relative 
10 to a VNTR for which there was only one allele prior to mis-match cleavage, 
would be: 

6/ 18/ y 2587 . 1/^ y 1/^ y 1/^ 

16 X 36 X 324 • 1 ^ 1 X i 

-43/ . i 
- 288 ' ' 

15 In the same way the most common allele of any VNTR will 

predominate after a sufficient number of rounds of mis-match cleavage. 
Four rounds may be sufficient to reduce the VNTRs to near homozygosity, 
but the efficiency of enzyme digestion, the generation of polymerase errors 
and the kinetics of hybridisation are factors that will influence this. Disparity 

20 in the allele frequencies of affected and wild type VNTRs will lead to 

enrichment of different alleles in each group if the imbalance is sufficiently 
large. Such alleles are informative for the trait of interest but must be 
selected from other enriched alleles that may be identical in both the 
affected and wild type groups if these predominate in the population in 

25 general irrespective of the trait. 

Further examples of mis-match discrimination under different 
scenarios is given in the Appendix. 



30 



Selection of alleles informative for a trait 

Selection of the alleles linked to the trait of interest may be 
achieved in a number of ways. Disparity in the allele size of each VNTR 
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surviving successive rounds of the mis-match cleavage procedure may be 
identified by hybridisation of these alleles from each group of individuals to 
an array of VNTR alleles of known length and spatial separation such that 
differences can.be detected. Indeed, it may be possible to achieve 
5 quantitative hybridisation to an array in a similar manner that generates 
information regarding allele frequencies in the two groups without need of 
the mis-match cleavage procedure. 

A less elaborate procedure involves the subtraction of the 
alleles in one group from those in another to identify differences in allele 
10 frequencies. However, this method must identify not only a VNTR for 
which an allele is present in one group but no alleles survive in the other 
group, but also a VNTR for which the alleles surviving in each group are 
different since both of these scenarios suggest linkage disequilibrium with 
the trait of interest. This can be achieved physically, chemically or 
15 enzymatically. If enzyme based selection is chosen it is preferable to 
amplify the alleles that have been enriched by the mis-match cleavage 
procedure with adapter primers that lack phosphorothioate bonds in order 
that enzyme digestion can proceed to completion. 

A suitable method of enzyme based selection involves the 
20 addition of protective termini, including but not limited to a 3' overhang of at 
least four nucleotides or an a-thiophosphate linkage, to the surviving alleles 
of one group of individuals and subtraction with an excess of those 
sun/iving from the other group using Exonuciease III. Under most 
circumstances identification is required of any allele surviving from the 
25 affected individuals that fails to survive from those individuals lacking that 
trait. For this, addition of the protective termini should added only to the 
VNTRs derived from affected individuals. Obviously, the alternative 
strategy is possible. A 3' overhang may be created in a number of ways 
including but not limited to (A) ligation of an adapter, or by (B) non-template 
30 addition of nucleotides by a DNA polymerase. Of these, method (B) was 
found to be the more efficient which may be achieved using an enzyme 
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such as Terminal deoxynucleotidyl transferase. This enzyme may 
generate a 3' overhang of several hundred nucteotides on incubation in the 
presence of a single deoxynucieotide triphosphate. An a-thiophosphate 
linkage may be incorporated by addition of a protective deoxynucieotide 

5 analogue using a DNA polymerase including but not limited to Terminal 
deoxynucleotidyl transferase. Suitable analogues include a-thio 
deoxynucieotide triphosphates. Since these analogues may inhibit 
subsequent digestion or manipulation of the DNA molecules the addition of 
a 3' overhang to impart protection is preferred. Another less preferred 

10 method of imparting protection to the activity of Exonuclease 111 is through 
the action of an exonuclease with 5' to 3' activity, including but not limited 
to T7 gene 6 exonuclease, that may create a 5' recess in duplex DNA. The 
appropriate incorporation of phosphorothioate bonds within the adapter 
primer that is used to amplify the DNA molecules would ensure that 

15 digestion by T7 gene 6 exonuclease beyond that required to impart: 

resistance to Exonuclease ill is prevented. Simiiariy, a 5' recess could be 
created by incorporation of a uracil rich 5' end in the adapter primer which 
could be digested using an enzyme such Uracil DNA giycosylase. 



B 



EH3 m^.. 



Ligation of alleles 
from the affected pool 
to a second adapter 



dATP + Terminal 
deoxynucleotidyl 
transferase 



20 

The resulting molecules are resistant to Exonuclease 111 
digestion because of the 3' overhang that is created. Hybridisation to an 
excess of the surviving wild type alleles ensures heteroduplex formation of 
25 all affected alleles providing an allele of the appropriate VNTR survives in 
the wild type group. 
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If there are no wild type alleles to subtract from those of the 
5 affected group homoduplex molecules that possess a 3' overhang at each 
end wiil result (molecule 1). If the surviving allele of a VNTR differs 
between the two groups a heteroduplex molecule containing a mis-match 
will result (molecule 2). Surviving alleles of equal size in the two groups will 
give rise to heteroduplex molecules without a mis-match (molecule 3). The 
10 other species of DNA that will result from the hybridisation include 

homodupiexes of wild type alleles that may or may not contain a mis-match 
(molecule 4) and single stranded molecules that fail to hybridise. Digestion 
of these different types of molecule by an enzyme that acts on single 
stranded DNA or conformational irregularities in DNA, including but not 
15 limited to T4 endonuclease VII, results in cleavage of those duplexes 
containing a mis-match with the generation of a 3' overhang at the site of 
cleavage. 
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4 

I i v/zr^^cm -ma 



The subsequent digestion by Exonuciease 111 renders single 
5 stranded ait duplexes or fragments of duplexes that do not possess a 3' 
overhang at each end. 



1 2 




Since the digestion of susceptible molecules by Exonuciease 
111 tends to go to completion further digestion with a single strand specific 
exonuciease or endonuclease eliminates all single stranded DNA species 
and removes the 3' overhang on the surviving molecules. Therefore, only 
the target molecules survive digestion. Exonuciease 1 is suited to this task 
but often leaves a single nucleotide 3' overhang that must be removed if 
blunt end cloning is chosen as the means by v\/hich the target molecules 
are recovered. 
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For the intact homoduplexes the informative allele is present 
within the homoduplex and may be identified by cloning and sequencing. 
For T4 endonuclease VII cleaved fragments that have survived digestion by 

5 Exonuclease 111 and Exonuciease t, the full length VNTRs can be obtained 
by hybridisation of the fragments to fragmented, terminated adapter-ligated 
genomic DNA followed by amplification in a similar manner to that 
previously described. The informative allele may be identified by 
genotyping the individuals manifesting the trait of interest with respect to 

10 these VNTRs using VNTR-specific primers designed from their flanking 
sequences. 

It is obvious that this method of subtraction is equally suited 
to other alleles besides those of VNTRs that may be generated in a variety 
of different ways. As such, this method of identifying differences in the 

15 composition of DNA pools may be applied more widely for selection of 

other types of polymorphic sequences as well as other species of DNA that 
may be present in one pool but absent in the same form in another. 

This method is unique in its suitability for investigation of 
polygenic as well as monogenic hereditary traits. It is likely to make a 

20 significant impact in the study of hereditary traits, reducing considerably the 
difficulty, time and expense that is currently associated with this field of 
research. 

The preferred embodiment 

25 (i) Fragmentation of genomic DNA of an individual of the species 

under investigation, but not necessarily an individual in that investigation, 
with a single restriction enzyme. 
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(ii) Termination of all 3' ends by Terminal deoxynucleotidyl 
transferase in the presence of a dideoxynucleotide triphosphate. 

(iii) Ligation of the terminated fragments to an adapter by 
incubation in the presence of T4 DNA ligase, followed by termination of 

5 single-stranded nicks. 

(iv) Purification of the ligated products from the ddNTPs and 
amplification in reactions containing: 

a) adapter primer and an (AC)nB primer, where B=G+T+C; 

b) adapter primer and a (CA)nD primer, where D=G+A+T; 
10 c) adapter primer and a (GT)nH primer, where H=A+T+C; 

d) adapter primer and a (TG)nV primer, where V=G+A+C. 

The products of amplification result from genomic fragments 
that successfully ligate to the chosen adapter and contain a VNTR with 
homology to the chosen primer. 
15 (v) Digestion of the (AC)nB and (CA)nD primed products by T4 

DNA polymerase in the presence of dATP and dCTP, followed by 
Exonuclease VI! to remove all VNTR sequences and excess VNTR primer. 

(vi) Digestion of the (GT)nH and (TG)nV primed products by T4 
DNA polymerase in the presence of dGTP and dTTP, followed by 

20 Exonuclease VII to remove all VNTR sequences and excess VNTR primer. 
Size selection may be performed to obtain products of an optimal range of 
molecular weights. 

(vii) Hybridization of an excess of either the combined (AC)nB 
and (CA)nD primed products or the combined (GT)nH and (TG)nV primed 

25 products with a sufficient amount of genomic DNAs derived from individuals 
manifesting a particular trait of interest. 

(viii) Incubation of the hybridized products with Taq DNA 
polymerase to achieve strand extension of all annealed 3' ends. 

(ix) Addition of adapter primer and generation of VNTR alleles 

30 from the 'genomic template' by themnal cycling in the presence of Taq DNA 
polymerase. 
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(X) Purification of the generated VNTR alleles followed by strand 

dissociation and reannealing under stringent conditions. 

(xi) Digestion with T4 endonuclease VII of mis-match containing 
duplex molecules that result from hybridization of VNTR alleles to spurious 

5 products of amplification, or hybridization of VNTR alleles that differ among 
the individuals under investigation manifesting a particular trait of interest. 

(xii) Further digestion by T7 gene 6 exonuctease together with S1 
nuclease to remove VNTR sequence from cleaved molecules or eliminate 
them completely. 

10 (xiii) Amplification of the surviving DNA molecules by thermal 

cycling in the presence of Taq DNA polymerase. 

(xiv) Repetition of hybridization, digestion and amplification of the 
surviving DNA molecules. This enriches the reaction in VNTR alleles that 
are common to all individuals manifesting the particular trait of interest or 

15 those alleles that predominate in such a group and removes any spurious 
products of amplification. 

(xv) Addition of a 3' overhang to the selected alleles of the group 
of individuals manifesting a particular trait by incubation with Terminal 
deoxynucleotidyl transferase in the presence of a dNTP. 

20 (xvi) Hybridization of the selected VNTR alleles of the group of 

individuals manifesting a particular trait that possess a 3' overhang to an 
excess of the VNTR alleles of individuals in which the trait is absent that 
have been generated from their genomic DNAs in a method bearing 
similarity, wholly or in part, with (i) to(xiv). 

25 (xvii) Digestion of mis-match containing duplex molecules by T4 

endonuclease VII. 

(xviii) Further digestion by Exonuciease III to eliminate strands in 
duplex molecules that lack protection by a 3' overhang. 

(xix) Further digestion, after removal or inactivation of the 

30 Exonuciease 111, by Exonuciease I to remove single stranded DNA. This 
results in elimination of all molecules other than the VNTRs linked to the 
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particular trait For intact VNTRs the informative allele is present. For 
cleaved VNTRs that survive digestion by Exonuciease Ml and 
Exonuclease I the entire VNTR sequence may be obtained after 
hybridisation to fragmented, terminated, adapter-ligated genomic DNA and 
5 strand extension by Taq DNA polymerase such that VNTR specific primers 
may be designed from the flanking sequences that allow genotyping of 
affected individuals to implicate the informative allele linked to the trait. 

A second embodiment 

10 (i) VNTR alleles are generated by means other than processes 

of amplification of fragmented and ligated genomic DNA with adapter 
primer and VNTR primer, hybridization of the generated products to 
genomic 'template' DNAs of individuals manifesting a particular trait, and 
generation of the respective VNTR alleles from those template DNAs. 

15 These may include but are not limited to: 

a) amplification of VNTRs from genomic or synthetic DNA using 
primers specific to the flanking regions of each VNTR in individual 
reactions; 

b) amplification of VNTRs from genomic or synthetic DNA using a 
20 multiplex system, thereby allowing amplification of multiple VNTRs en 

masse using adapted VNTR specific primers; 

c) amplification of VNTRs from genomic or synthetic DNA using an 
endonuclease that cleaves in or about VNTR sequences such that 
adapters may be ligated to the digested DNA and used for amplification of 

25 the VNTR alleles; 

d) generation of a pool of VNTRs from individuals manifesting a 
particular trait by processes of subtraction with those in which the trait is 
absent. 

(ii) Purification of the generated VNTR alleles followed by strand 
30 dissociation and reannealing under stringent conditions. 

(iii) Digestion with T4 endonuclease VII of mis-match containing 
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dupiexes that result from hybridization of VNTR alleles to spurious products 
of amplification, or hybridization of VNTR alleles that differ among the 
individuals under investigation manifesting a particular trait of interest. 

(iv) Incubation of the hybridized alleies in the presence of T7 

5 gene 6 exonuclease and S1 nuclease such that the digested duplex DNA 
molecules and single stranded DNA species are eliminated. 

(v) Enrichment by amplification of mis-match free duplexes that 
are resistant to digestion, 

(vi) Repetition of hybridization, digestion and selection of mis- 

10 match free molecules. This enriches the reaction in VNTR alleles that are 
common to all manifesting the particular trait of interest and removes any 
spurious products of amplification. 

(vii) Hybridization of the selected VNTR alleles, that are common 
to all individuals manifesting a particular trait, to the VNTR alleles of 

15 individuals in which the trait is absent that have been generated from their 
genomic DNAs in a method bearing similarity, wholly or in part, with 
(i) to (vi). 

(viii) Digestion with T4 endonuclease VII of mis-match containing 
duplexes followed by successive incubation with Exonuclease 111 and 

20 Exonuclease I. 

(ix) Selection from the mixture of those surviving molecules that 
lack a 5' overhang. These entire VNTRs or VNTR fragments are linked to 
the particular trait of interest. The informative allele, with respect to the trait 
of interest, of the entire VNTRs can be established by sequencing. For the 

25 VNTR fragments the full length sequence can be generated by 

hybridisation to fragmented, terminated and adapter-ligated genomic DNA 
followed by incubation with Taq DNA polymerase. The informative allele 
may be established by various methods including but not limited to 
genotyping individuals manifesting the trait of interest using VNTR-specific 

30 primers designed from the flanking sequences. 

Those that are skilled in the art will appreciate that there are 
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several methods of differentiating mis-match containing duplexes from 
those that are free of mis-matches, either in solution or on an array. The 
methods described in the above embodiments represent only one of these 
methods. 

5 Those that are skilled in the art will appreciate that the 

invention is equally well suited any type of VNTR including but not 
restricted to dinucleotide repeats e,g.(CA)n and (GT)n, trinucleotide repeats 
e.g,(AAT)n, (AGC)n, (AGG)n, (CAC)n, (CCG)n and (CTT)n, and 
tetranucieotide repeats e.g.(CCTA)n, (CTGT)n, (CTTT)n.(TAGG)n, 
10 (TCTA)n, and (TTCC)n. In addition, the invention may be applied to simple 
organism microsatellites that include, but are not limited to, (AT), (CC), 
(CT) and (GA) rich tracts of repetitive motifs. 

Those that are skilled in the art will appreciate that 
polymorphic alleles, other than those of VNTRs, may be used with the 
15 invention to produce alleles that are free of spurious products of 

amplification and are common to all individuals manifesting a particular 
trait. These polymorphic alleles may be hybridized to a fixed array of all 
possible alleles, or subset thereof, or to a pool of alleles derived from 
individuals in which that trait is absent. By mis-match discrimination those 
20 alleles linked and informative for a trait can be identified. 

Those that are skilled in the art will appreciate that alleles 
from the genome of a single individual, or more than one individual, of 
unknown phenotype and genotype may be amplified with fidelity, removing 
the spurious products of amplification by mis-match discrimination, and 
25 hybridized to a fixed array of alleles, or to a pool of alleles in solution, in 
order assign a genotype or a phenotype to that individual. 

Those that are skilled in the art will appreciate that mis-match 
discrimination may be performed using enzymes or chemicals other T4 
endonuclease Vll. These alternatives include but are not limited to S1 
30 nuclease, Mung Bean nuclease, mutation detection proteins (e.g. Mut S), 
osmium tetroxide and hydroxylamine. 
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Those that are skilled in the art will appreciate that the 
polymorphic sequences that are amplified are themselves valuable and 
may be used in protocols other than that which determines co-segregation 
of VNTRs with a hereditary trait including but not limited to genotyping, 
5 mapping, positional cloning, quantification of trait loci, studies of ancestry 
and evolution, population studies, phyiogenetics, and the study in vitro as 
we!! as in vivo of VNTRs and the sequences that separate them. 

Those that are skilled in the art will appreciate that the invention 
may be used to identify somatic mutations that are non-hereditary if a 
10 VNTR is involved in that mutation. 

Those that are skilled in the art will appreciate that the 
terminated and adapter-ligated genomic fragments may be used to 
recreate or amplify that region of the genome with sequence homology to 
any nucleotide sequence known or unknown to which they are hybridised. 
15 Those that are skilled in the art will appreciate that the 

method represents a means of purifying a consensus sequence from PGR 
products such that the spurious products of amplification are eliminated. 

Those that are skilled in the art will appreciate that the 
method represents a means of purifying a consensus sequence from any 
20 pool of one or more types of DNA molecule. 

The invention differs fundamentally from all previous 
techniques since genomic fragments are generated that do not reflect the 
polymorphic variation at the locus from which they were derived. 
Furthermore, these fragments need not be generated from an individual in 
25 a particular investigation, but may be from any individual of the appropriate 
species. However, hybridization of these fragments to genomic 'template' 
DNA of an individual subject to investigation and mis-match discrimination 
permits amplification, with fidelity, of alleles within that genomic template 
whilst overcoming the problems of generation of spurious products that are- 
30 a feature of other PCR-based methods. If the genomic fragments are 
derived from a single individual the problems of polymorphic variation 
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within the sequences that flank each VNTR are negated because these will 
be identical for all individuals under investigation. Since the invention 
preserves each VNTR allele with its flanking sequences, these alleles 
remain highly informative. In this respect the invention is unique. 
5 Furthermore, this novel method of generating VNTRs is rapid, inexpensive, 
has no requirement for prior knowledge of sequence, and has no 
requirement for elaborate equipment, it is of immense importance obviating 
the high investment of time and money that is currently required for 
isolation of VNTRs, Consequently, the application of technologies 
10 dependant on the availability of VNTR in species in which none have been 
isolated will be possible where previously this was unfeasible. The ability 
to generate large numbers of VNTRs from all species quickly, efficiently, 
cheaply and with fidelity is a considerable contribution of the present 
invention to workers in the to the biomedical field. 
15 In summary, the invention involves a novel method of 

generating VNTRs encompassing restriction endonuclease digestion of 
DNA, ligation of the fragments to adapters and, by introduction of a primer 
with sequence homology to a chosen VNTR, amplifying only those 
fragments that are flanked by a chosen endonuclease restriction enzyme 
20 site and a VNTR. These fragments are not representative of the alleles of 
each VNTR and need not be generated from any specific individual under 
investigation. Hybridization of these fragments with genomic DNA of the 
individuals under investigation recreates the intact VNTR alleles with 
flanking sequence, as they occur in the genome. This in itself constitutes a 
25 major step in the ability of workers in the biomedical fields to generate 
quickly, efficiently, cheaply and with fidelity VNTRs in all species for 
purposes reliant on the availability of VNTRs, including but not confined to 
DNA fingerprinting and linkage analysis. The incorporation of a mis-match 
discrimination procedure overcomes the problems of miss-priming and 
30 generation of spurious products by reaction contamination and subtle 
variation in reaction conditions, that are to the detriment of all PCR-based 
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technoiogjes, and allows exclusion of alleles that are not comnnon to all 
individuals under investigation that manifest a particular trait. A second 
round of mis-match discrimination removes un-informative alleles that are 
present in the genomes of individuals that do not manifest the trait This 

5 procedure is designated a Total Representation of Alleles that are 

Informative for a Trait (TRAIT). The invention, therefore, has significant 
advantages over previous methods, embracing the speed of analysis of 
AFLP, GMS, RDA and RAPD, and the high polymorphism detection rate of 
linkage analysis, but negating the need for DNA from closely related 

10 individuals and for paternity testing. The invention also overcomes 
fundamental problems that are a feature of PGR based technologies, 
including miss-priming and generation of spurious products through 
reaction contamination and subtle variations in the conditions of reaction. 
Furthermore, there is no requirement for expensive equipment or elaborate 

15 statistical computer software. The analysis will give rise to alleles that are 
both linked and informative, being present exclusively or at a higher 
frequency in individuals manifesting the trait of interest but absent or 
present at a lower frequency in those individuals that lack the trait. In this 
respect, the invention is unchallenged in its superiority over all other 

20 methods. 

The invention allows concomitant detection of polymorphisms 
at multiple loci by simultaneous comparison of simple or complex genomes 
from multiple individuals and differs fundamentally from all other techniques 
that have been previously employed. The invention represents a major 

25 advance in the ability of workers in the biomedical fields to generate 

VNTRs from the genomes of any species quickly, efficiently, cheaply and 
with fidelity in addition to screening complex genomes for polymorphisms 
co-segregating with hereditary traits. Application of this procedure will 
therefore facilitate the development of markers for genetic screening for 

30 hereditary disease, or advantageous monogenic or polygenic traits in ail 
organisms. 
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Examples of How the invention may be Applied 

The following illustrations represent exampies of how the 
invention may be applied without inferring any limitation to scope of the 
invention or any limitation to the different ways in which the invention may 
5 be applied. 

Experimental Data 

Example 1 

Preparation of ampiimers using {CA)13 and (GU)i3 primers, 

10 2|jg DNA was completely digested with 3^1 Rsa 1 in a total 

volume of lOOpi: 

8.5|j1 genomic DNA (equivalent to 3pg DNA) 

1 0[i\ 1 0x reaction buffer 

3pl Rsa 1 {10u/pl; Promega) 
15 78.5M1 dH,0 

100m1 

The reaction was incubated at SY^'C over night followed by 
heat inactivated by incubation at 70°C for 20 minutes. The DNA was 
separated from the buffer by microconcentration (Microcon-100; Amicon). 
20 A volume of 1 0|jI was recovered. 

2nmoles of 48mer and 2nmotes of 12mer oligonucleotides 
that constitute the adaptor were combined: 

1 5.9pl 48mer (equivalent to 2 nmoles) 

13.7ijl 12mer (equivalent to 2 nmoles) 
25 1 0pl 1 0x ligase buffer (NEB) 

48.4|jl dHoO 

88iJl 

The mixture was heated to 50°C and allowed to cool to 10°C 

over 1 hour. 

30 To the 88pl of annealed adaptor was added the 10^1 of 

digested DNA and ligation of the adaptor to the genomic fragments was 
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performed: 

88[ji annealed adaptor/ ligase buffer (containing ATP) 
10|jIDNA 

2iJl T4 DNA ligase (400 NEBu/tJl) 

5 100|JI 

The reaction was incubated at 16°C over night and then heat 
inactivated by incubation at 70°C for 20 minutes. 

The adaptor-ligated DNA fragments were separated from the 
buffer and non-ligated adaptor by microconcentration (Microcon-100; 
10 Amicon). A volume of 12pl DNA was recovered. 

The adaptor-tigated DNA fragments were incubated with Taq 
DNA polymerase in the presence of dideoxynucleotide triphosphates to 
prevent 3' extension of the adaptor and non-ligated DNA in subsequent 
manipulations: 
15 12|j1 microconcentrated DNA 

3|jl 10x NH4 reaction buffer 
1pl 50mM MgCi2 
1|j| 10mM ddATP 
lMi 10mM ddCTP 
20 lOmMddGTP 

10mM ddTTP 
1pl Taq DNA polymerase (5u/pl; Bioline) 
gyLdHsO 

SOmI 

25 The reaction was incubated at ll^'C for 2 hours. 

The adaptor-ligated DNA with terminated 3' ends was purified 
by phenol/chloroform extraction and microconcentration. The volume 
recovered was made up to 40pl and the concentration of DNA was gauged 
by get electrophoresis. A concentration of 75ng/pl was determined. 

30 (CA) primed ampiimers and (GU) primed ampiimers were 

generated in separate reactions: 
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10|ji 10x NH4 reaction buffer 
8}jl 50nnM MgC!2 
1.5iJI lOmMdNTPs 

1pl adaptor-ligated DNA with terminated 3' ends 
5 4iJl (CA) or (GU) primer (25pmol/iJl) 

73.5mI dH.O 
98mI 

The reaction was overlaid with mineral oil and heated to 95°C 
for 2 minutes, during which time 1pl Taq DNA polymerase (5u/pl; Bioline) 
10 and 2pl adaptor primer (50pmol/iJl) were added. 

Thermal cycling was performed as follows: 95°C for 30 
seconds, then 72°C for 45 seconds for a total of 20 cycles, followed by 
72°C for 5 minutes. 

To the lOOiji of (CA) primed products was added Spl 
15 Exonuciease I (10u/ijl) to remove the remaining (CA) primer. This reaction 
was incubated at 37°C for 30 minutes. 

To the 100|j1 of (GU) primed products was added lOpl Uracil- 
DNA glycosylase (1u/pi; NEB) to digest all uracil incorporated into the PGR 
products. This reaction was incubated at 37°C for 2 hours. Ipl 10mM 
20 dNTPs was added followed by 2|j1 T4 DNA polymerase (5u/|jI; Epicentre 
laboratories) to remove the protruding (OA) strand that complemented the 
digested (GU) sequence. This reaction was incubated at 37°G for 5 
minutes. Both the pools of amplimers were phenol/chloroform extracted 
and microconcentrated (Microcon-100; Amicon). For each pool, the 
25 volume recovered were made up to 500pl, of which 5pl was analysed by 
spectrophotometry to determine the concentration of DNA. 

Equal amounts of (CA) and (GU) primed amplimers were 
hybridized to genomic 'template' DNA of a single individual prior to thermal 
cycling. In order to gauge the optimal ratio of amplimer to genomic 
30 'template' DNA several reactions were performed using various amounts of 
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'template' DNA while keeping the amount of amplimers constant: 

Template" DNA (ng) 0 0.1 1 10 100 1000 

Combined amplimers (ng) 111111 
5 5MNaCI(pl) 0.22 0.22 0.22 0.22 0.22 0.22 

clHjO (pi) To a final volume of 5.55pl 

Each reaction was overlaid with mineral oil and incubated at 
98°C for 5 minutes, after which the temperature was reduced stepwise to 
78°C over 4 hours. 
10 The following was added to each hybridization: 

5pl 10X NH4 reaction buffer 
4pl 50mM MgClj 
0.75pl 10mM dNTPs 
O.Spl adaptor primer (SOpmoi/pl) 
15 34.2pi dHsO 

Each reaction was spun briefly in a microfuge. They were 
heated to 72°C for 2 minutes and 0.5pl Taq DNA polymerase (5u/pi;Bioline) 
was added. The reactions were incubated at 72°C for a further 10 minutes, 
after which the temperature was raised to 95°C for 2 minutes. Thermal 
20 cycling was performed as follows: 95°C for 30 seconds, then 72°C for 1 

minute, for a total of 10 cycles. 

For each reaction lOpI of products amplified for 10 cycles 
were added to 40pl of reaction mix and amplified under the same 
conditions for an additional 22 cycles. 5pl of the ends products of 

25 amplification were run on an agarose gei. The reaction containing lOOng 
genomic 'template' DNA was found to yield the most products of 
amplification, equivalent to a ratio of 100:1 by mass of genomic 'template' 
DNA: amplimer. 

The invention was validated by cloning the products of 

30 amplification. Two colonies of E.coli that had successfully transformed 
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were cultured, from which ptasmids were later harvested. These plasmids 
were sequenced and were found to contain VNTR sequences at the 
multiple cloning sites. 

5 Further experimental data 

For the following experiments canine genomic DNA or cloned 
VNTR alleles amplified from canine genomic DNA were used. The cloned 
alleles were ligated into the Sma1 site of the pUC18 MCS, either side of 
which plasmid specific primers were designed for subsequent amplification 
10 of the plasmid inserts: 



Plasmid specific sense primer Plasmid specific antisense primer 

5'- ATGCCTGC AGGTCGACTCTAGAGGA GGCTCGAGCTTAAGGGATATCACTC -5^ 
GCCAAGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCCCTATAGTGAG 



Hind ill i 1 Pst I I 1 Xba 1 



Sac' 



Sph i Sai I BamH I 

Acc 1 Srna i 

Hinc II Xma I 



Kpn t EcoR I 



All reagents were obtained from Amersham Pharmacia 
15 Biotech, or its subsidiary companies, unless stated otherwise. 

Oligonucleotides were obtained from Genset Corp., France. 
The VNTR primers (AC)11B, (CA)11D, (GT)11H and (TG)11V comprised 
eleven repetitions of the sequence shown in brackets followed by a 
degenerate base were B = C + G + T, D = A + G + T, H = A + C + T, andV 
20 = A + C + G. 



Example 2 

Generation of adapter-ligated, dideoxynucleotide terminated genomic 
fragments with (a) termination preceding adapter ligation and (b) 
25 adapter ligation preceding termination. 

(a) 5\ig canine genomic DNA were fragmented with Hae 111, the 

digestion proceeding to completion over 12 hours at 37°G: 
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4.4|a[ 1.135p,g/|ai genomic DNA 

1 0^1 1 0x restriction buffer 

2^! 10u/^tHaelll 

Ml^i dHaO 
5 lOO^il 

Digestion was confirmed by eiectroplioresis of an aliquot of 
the reaction on a 1% agarose gel stained with ethidium bromide. 

The DNA was extracted (GFX purification column) and eluted 
in 50ia.i 5mM Tris pH8.5, of which 30\x\ was incubated with Terminal 
10 deoxynucleotidyi transferase for 3 hours at Sy'C : 

30^1 DNA 

30|j.l 5x Terminal deoxynucleotidyi transferase buffer 
4.5!al lOmMddGTP 

10p,i 9u/^i Terminal deoxynucleotidyi transferase 
15 TS^^iidHaO 
150^I 

The DNA was separated from low molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
dHaO between episodes of centrifugation. A volume of 35ii\ was 
20 recovered. 

An adapter was prepared by annealing two oligonucleotides, 
a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 12mer (TTCATACCTTCG): 

7.6|al 197pmol/p,l 24mer 
25 9.2!al 162pmol/jil 12mer 

1.87 ^.1 10x T4 DNA iigase buffer 

18.7^! 

The mixture was heated to 55^C and allowed to cool to 10^C 
over one hour. 

30 The adapter was ligated to the terminated genomic 
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fragments: 

35|il DNA 
18.7}xl adapter 

4.3^1 10x T4 DNA ligase buffer 
5 1 .5^1 1 0u/ul T4 DNA ligase 

M^idHaO 
62^l 

The reaction was incubated at 16°C over night, then heat 
inactivated at 70° C for 20 minutes. 
10 The DNA was separated from low molecular weight solutes 

by microconcentration (Microcon-30; Amicon) with successive additions of 
dHaO between episodes of centrifugation. A volume of 54p,l was 
recovered. 

To prevent generation of spurious products through priming 
15 from sites of single strand nicks, these were terminated by incubation with 
Thermo Sequenase: 

54^il DNA 

4.4|xl Themrio Sequenase buffer 

^A\x\ lOmM ddATP 
20 1.4|al 10mM ddCTP 

1.4nl 10mM ddGTP 

1.4^1 10mM ddTTP 

0.5JJ.I 32u/}il Themno Sequenase 

S^^ldHaO 
25 70^1 

The mixture was overlaid with mineral oil and incubated at 
74°C for 2 hours. 

The DNA was extracted (GFX purification column) and eluted 

in 50|xl 5mM Tris pH 8.5. 
30 (b) 5^g canine genomic DNA were fragmented with Mbo I, the 
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digestion proceeding to completion at 37°C: 
AA[i\ 1.135p.g/tJ.I genomic DNA 
10jj,l lOx restriction buffer 
2.5|il lOu/^il Mbo 1 
5 SSjaidHaO 
100^1 

Digestion was confirmed by electroplioresis of an aliquot of 
the reaction on a 1% agarose gel stained with ethidium bromide. 

Following incubation at 7Q'='C for 20 minutes the DNA was 
10 separated from low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dH20 between 
episodes of centrifugation. A volume of 32)i! was recovered of which haif 
was iigated to an adapter: 

An adapter was prepared by annealing two oligonucleotides, 
15 a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 16mer (GATCGTTCATACCTTC): 
6.3jil 197pmol/^l 24mer 
S.Sial 147pmol/|al 16mer 
13^\i 10x T4 DNA ligase buffer 
20 16.5|al 

The mixture was heated to 55°C and allowed to cool to 10°C 
over one hour. 

The adapter was Iigated to the genomic fragments: 
16^1 DNA 
25 16.5(j,l adapter 

2.4|il 10x T4 DNA ligase buffer 
2^1 10ul/|alT4 DNA ligase 
SjlMidHsO 
40^l 

30 The reaction was incubated at 16"=*C over night, then heat 
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inactivated at 70°C for 20 minutes. 

The DNA was separated from low molecuiar weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
dH20 between episodes of centrifugation. A volume of 40jj,l was 
5 recovered. 

The adapter-ligated fragments were terminated using Thermo 

Sequenase: 

40^l DNA 

4,4|j-l Thermo Sequenase buffer 
10 1.4|al lOmMddGTP 

0.5jj,l 32u/ul Thermo Sequenase 

24^1 dHaO 

70^1 

The reaction was overlaid with mineral oil and incubated at 
15 74''C for 1 hour. To prevent generation of spurious products through 

priming from sites of single strand nicks, these were terminated by further 
incubation with Thermo Sequenase and addition of the remaining ddNTPs: 
r4)Lil 10mM ddATP 
1.4|ii 10mM ddCTP 
20 1.4iLii 10mM ddTTP 

0.3jalThermo Sequenase buffer 
4.8fil 

The reaction was incubated at 74°C for a further hour. 

The DNA was extracted (GFX purification column) and eiuted 
25 in 50[i\ 5mM Tris pH 8.5, 

Methods (a) and (b) of adapter ligation and termination of the 
genomic fragments were compared by amplification of the resulting 
fragments with or without an 'internar primer in reactions comprising the 
following: 



30 
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5^1 


5^1 


5^1 


1 0x Taa PGR buffer 






5|il 


lOxdNTPs 


VI 


VI 


VI 


25pmol/|j.l 24mer 




VI 


Oiil 


50pmol/^l (AC)11B 


50ng 


Ong 


50ng 


GFX extracted DNA 




to 50^il 


dHaO 



Each reaction was overlaid witti mineral oil and heated to 
95°C for 2 minutes. 

0.5\xl of 5u/^l Taq DNA polymerase was added to each 
10 reaction, which was amplified for 25 repetitions of 95°C for 30 seconds, 
65°C for 30 seconds, 72°C for 1 minute, followed by a final extension of 
72X for 5 minutes. 

7.5p,l of each reaction was subjected to electrophoresis on a 
1.5% agarose gel stained with ethidium bromide. The negative control 
15 reactions that lacked DNA generated no product, while those reactions 
containing all components generated a smear of products of various 
molecular weights. In contrast, the reactions containing DNA but no 
internal primer were incapable of generating product. These results 
confirmed that adapters had been ligated successfully to genomic 
20 fragments and all 3' ends capable of extension in the presence of a DNA 
polymerase had been terminated. The preferred method was termination 
prior to ligation since (i) this guaranteed that all fragments successfully 
ligating were terminated and (ii) the opportunities for inter-fragment ligation 
were remote. 

25 

Amplification of 5' and 3' flanking sequences from terminated, 
adapter-ligated genomic fragments. 

Amplification reactions were performed for each VNTR primer 
containing the following: 



30 
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5[il 5^! 1 0x Taq PGR buffer 

5^1 5^1 lOxdNTPs 

2^l 2^1 25pmol/^l 24mer 

2ii\ 2|al 25pmol/^l (AC) 11 B or (CA) 1 1 D or (GT) 1 1 H or 



(TG)IIV 



2(al 0|j.l fragmented, terminated, adapter-ligated 
genome (approx. 50ng/ial) 

34!ii 36|ii dHaO 
50|al 50^1 

10 In addition, a parallel reaction was prepared containing all 

components except a VNTR primer. 

All reactions were overlaid with mineral oil and heated to 
95°C for 2 minutes. 0.5|liI of 5u/^I Taq DNA polymerase was added to each 
tube and amplification was achieved by thermal cycling for 18 repetitions of 

15 95°C for 30 seconds, 65°C for 45 seconds, 72°C for 45 seconds, followed 
by a final extension of 5 minutes at 72'*C. 

5|LLi of each reaction was loaded onto a 1.5% agarose gel 
stained with ethidium bromide, along with a molecular weight marker. The 
reactions that contained all components generated a smear of products of 

20 ranging from approximately 100 to 500bp, the intensity and distribution of 
molecular weights being comparable for each reaction. The lanes 
corresponding to those reactions lacking DNA and the reaction lacking a 
VNTR primer did not contain any product of amplification. 



25 Example 3 

The efficiency of digestion of the repeat sequence from a VNTR 
primed PGR product by T4 DNA polymerase was assessed. 

A cloned VNTR allele was amplified by Taq DNA polymerase 
and separated from low molecular weight solutes by microconcentration 
30 (Microcon-30; Amicon) with successive additions of dH20 between 
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episodes of centrifugation. A volume of 40|al was recovered, the 
concentration of which was judged by agarose gel electrophoresis to be 
130ng/|al, approximating to 1.3pmo[/fil. 

A LSu/jal dilution of T4 DNA polymerase was prepared with 
5 dHsO. The amplified DNA was digested at a concentration of 0,3pmoi/^l 
with varying concentrations of T4 DNA polymerase at 12^C: 

1 .Sfil 1 0x T4 DNA polymerase buffer 

0.75|al 10mM dATP 

0.75|al lOmMdCTP 
10 3.5^1 DNA 

0, 0.5, 1 . 2. or 4^1 1 .5u/fil T4 DNA polymerase 

to 15^1 dHsO 

Parallel reactions were prepared that lacked dNTPs. 
The reactions were incubated at 12°C for 1 hour, followed by heat 
15 inactivation at 70°C for 20 minutes. 

7.5|j,l of each reaction were subjected to electrophoresis on a 
2.5% agarose gel stained with ethidium bromide. In the absence of dNTPs 
all DNA was digested with enzyme concentrations exceeding 0.05u/|j,l. By 
contrast, there was no discernible loss of DNA in the presence of dNTPs at 
20 any concentration of T4 DNA polymerase. 

The efficiency of digestion of the repeat sequence from a VNTR 
primed PGR product by T7 gene 6 exonuclease was assessed, 

A cloned VNTR allele was amplified with the plasmid specific 
25 sense primer and the (GT)1 1 H primer by Taq DNA polymerase in the 

presence of [a-33P] dATP. Parallel reactions were performed for primers 
that contained or lacked a succession of four phosphorothioate bonds. In 
the primer pair containing phosphorothioate bonds these where located at 
the 5' end of the plasmid specific primer and at the 3^ end of the {GT)11H 
30 primer. 
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The amplified DNA was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with successive 
additions of dH20 between episodes of centrifugation. Equal amounts of 
the amplification reactions were digested by T7 gene 6 exonuclease at 
5 37°C for 15 and 30 minutes, the concentration of DNA approximating to 
0.1pmol/ul: 

3.6|al DNA 

2|al 5x T7 gene 6 exonuclease buffer 
1 1ll 1 0u/)il T7 gene 6 exonuclease 
10 SApii dH20 

10^1 

A control reaction was incubated for 15 minutes at 37°C in the 
absence of enzyme. 

All reactions were denatured at 95°C for 2 minutes with 
15 addition of 5jul formamide loading dye. 1 0|il of each sample was subjected 
to electrophoresis on an 8% polyacrylamide denaturing gel. An 
autoradiography film (Biomax MR; Kodak) was exposed to the gel after it 
had been fixed and dried. 

It was found that after 15 minutes of incubation the DNA that 
20 lacked phosphorothioate protection had been digested completely. By 

contrast, the presence of phosphorothioate bonds preserved the DNA, one 
strand in each molecule becoming shortened by digestion of the enzyme, 
although some non-specific loss of DNA was seen. 



25 The efficiency and specificity of digestion by T4 endonuciease VII and 
S1 nuclease was compared. 

Cloned VNTR alleles of the same VNTR that differed in their 
repeat lengths by 4 nucleotides were amplified separately in the presence 
of [a-33P] dATP. The products derived from the shorter allele were divided 
30 equally between two tubes. To one tube an equal amount of the longer 
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allele was added and the mixture was hybridised by denaturing at 98°C for 
2 minutes and annealing at 75°C for 150 minutes in lOOmM NaCI and 
200iaM CTAB. 

The hybridised and non-hybridised pools of DNA were 

5 separated from other low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dHaO between 
episodes of centrifugation. 

T4 endonuclease VII was diluted to 250u/!al in the supplied 
dilution buffer. Dilutions of SI nuclease were prepared in dHaO. Equal 

!0 amounts of either hybridised DNA or non-hybridised DNA were digested by 
50u/)al T4 endonuclease VII in Taq DNA polymerase buffer or by various 
concentrations of S1 nuclease in the supplied buffer. The S1 nuclease was 
added to the reactions to give final concentrations of 0.01u/|j,l, 0.03u/^l, 
0.1u/^il, and 0.3u/fj.l. In each case a control reaction that lacked enzyme 

15 was prepared. The reactions were performed at 37°C for 30 minutes. 

On completion of digestion the reactions were stopped by 
addition of EDTA and heat inactivation. An amount of formamide loading 
dye equal to half the reaction volume was added and each reaction was 
denatured by incubation at 95°C for 5 minutes. 12 |il of each sample were 

20 subjected to electrophoresis on an 8% polyacrylamide denaturing gel. An 
autoradiography film (Biomax MR; Kodak) was exposed to the fixed and 
dried gel. 

T4 endonuclease VII was found to cleave about half of all 
DNA derived from hybridisation of approximately equal amounts of two 

25 different alleles of the same VNTR, creating a characteristic pattern of 
cleaved products corresponding to the position of the mis-match within the 
repeat sequence at the time of cleavage. The DNA derived from the single 
allele that had not been hybridised and, therefore, comprised mis-match 
free double stranded DNA was not affected by T4 endonuclease VII. In 

30 contrast, the characteristic pattern of cleaved products that was seen with 
T4 endonuclease VII was not seen in association with SI nuclease under 
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any of the reaction conditions. As such, T4 endonuciease VI! was 
considered the better of the two enzymes in this application. 

Repetition of the T4 endonuciease VII reactions using various 
concentrations of enzyme for 30 minutes and 1 hour of digestion in 1x Taq 

5 PGR buffer, 1x Pfu buffer (Stratagene) and 1x T7 gene 6 exonuclease 
buffer confirmed that the enzyme digested predictably and reproducibly 
over a range of reaction conditions, their being no overt non specific 
digestion of DNA detectable at concentrations up to 200u/ial. The enzyme 
was found to cleave hybridised molecules containing mis-matches of a 

10 range of sizes. 

The characteristic pattern of cleaved products resulting from a 
mis-match within a repeat sequence was seen with S1 nuclease only when 
large amounts of DNA were loaded onto a poiyacrylamide gel. This was 
seen with a four nucleotide mis-match. The ability of S1 nuclease to 

15 resolve a two nucleotide mis-match was found to be poor 

The effect of enzyme concentration on the efficiency of cleavage of 
mis-match containing duplex DNA by T4 endonuciease VII was 
assessed. 

20 Two cloned VNTR alleles that differed in allele length by 2 

nucleotides were amplified separately using the plasmid specific primers, 
one of which had been labelled with [y-33P] ATP using T4 polynucleotide 
kinase. Each amplified allele was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with successive 

25 additions of dH20 between episodes of centrifugation. 

Half of the DNA derived from amplification of the smaller 
allele was saved. To the remaining half was added approximately an equal 
amount of amplified DNA of the larger allele. This mixture was denatured 
at 98°C for 2 minutes and then annealed at 75°C for 2 hours in the 

30 presence of lOOmM NaCl and 200^M CTAB, the transition between 

temperatures occurring rapidly. Separation of the annealed DNA from low 
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molecular weight solutes by microconcentration was repeated. 

Serial dilutions of T4 endonuciease VII were prepared in the 
supplied dilution buffer The non-denatured smaller allele and the allele 
mixture that had been denatured and annealed were each digested in Taq 
5 DNA polymerase buffer with T4 endonuciease VI! at final concentrations of 
Ou/|al, 50u/^l, 100u/)al and 150u/^l: 
Bjal DNA 

1^! 1 0x Taq PGR buffer 

3[xi T4 endonuciease VII 
10 10^1 

Incubation at 37°C was carried out for 30 minutes, after which 
each reaction was heated to 95°C for 2 minutes with addition of 5p.l 
formamide loading dye. 10}xl volumes were subjected to electrophoresis 
on an 8% poiyacrytamide denaturing gei, after which the gel was fixed, 
15 dried and exposed to an autoradiography film (Biomax MR; Kodak). 

Almost no digestion of the non-denatured smaller allele was 
detected. The little that was seen was assumed to have occurred as a 
result of digestion at sites of polymerase error or the annealing of stutter 
bands during the final cycle of amplification. In the lanes corresponding to 
20 the annealed allele mixture the characteristic pattern of digestion was seen 
to occur in the presence of T4 endonuciease VII. Although the amount of 
digestion at 100u/|al appeared to be slightly greater than at 50u/^l, the 
degree of digestion at each enzyme concentration was found to be almost 
uniform. 

25 Similar experiments were performed using various 

concentrations of T4 endonuciease VII in Pfu buffer (Stratagene) and T7 
gene 6 exonuclease buffer. Efficient digestion of mis-match containing 
DNA was found to occur in both reaction buffers, the degree of digestion 
maximising at concentrations of T4 endonuciease VII between 50u/|j,l and 

30 100u/|ii. Duplex DNA lacking a mis-match was resistant to T4 
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endonuclease VII under these conditions. 

The efficiency and specificity of S1 nuclease digestion in T7 gene 6 
exonuclease buffer was assessed* 

5 A cloned VNTR allele was amplified with the piasmid specific 

prinners, one of which had been labelled with [y-33P] ATP using T4 
polynucleotide kinase. The amplified product was separated from low 
molecular weight solutes by microconcentration (Microcon-30; Amicon) with 
successive additions of dH20 between episodes of centrifugation. The 

10 volume of recovered DNA was divided: SOjal was preserved as double 

stranded DNA while the remaining 30|al DNA was rendered single stranded 
by denaturation at QS^'C for 2 minutes followed by snap cooling on iced 
water. 

Dilutions of S1 nuclease were prepared in dH20. Equal 
15 amounts of double stranded DNA or single stranded DNA were digested in 
T7 gene 6 exonuclease buffer at SZ^'C for 5 minutes in the presence of S1 
nuclease at final concentrations of Ou/^1, 0.1u/)al, 0.3u/|ai, ^u/^xl and 3u/|aL 
On completion of digestion the reactions were stopped by addition of 
500mM EDTA pH8 to a final concentration of 25mM. 
20 The reactions were denatured by addition of formamide 

loading dye and heating to 95''C for 3 minutes, after which aliquots were 
subjected to electrophoresis on an 8% polyacrylamide denaturing gel. The 
gel was fixed, dried, and exposed to an autoradiography film (Biomax MR; 
Kodak). 

25 It was found that a concentration of 1 u/p.! S1 nuclease in T7 

gene 6 exonuclease buffer produced optimal digestion of single stranded 
DNA, there being no overt loss of double stranded DNA at this 
concentration. 
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Assessment of the digestion of DNA by T7 gene 6 exonuclease in 
concert with S1 nuclease. 

For assessment of T7 gene 6 exonuclease and S1 nuclease, 
DNA was amplified from a cloned VNTR allele using the plasmid specific 

5 sense primer with four phosphorothioate bonds at the 5' end and either the 
(AC)1 1B primer containing four phosphorothioate bonds at the 3' end or the 
(AC)1 1 B primer that lacked such bonds. The amplified products were 
separated from low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dHaO between 

10 episodes of centrifugation. The volumes recovered in each case were 
measured to be 40^1. These were found to contain approximately 
1.3pmol/|al and 0.35pmol/|j,l for the reactions primed by the VNTR primer 
with and without phosphorothioate bonds, respectively. 

T7 gene 6 exonuclease was diluted to 10u/|j.l in dH20. 

15 S1 nuclease was diluted to 10u/jal in dHaO. 

Each amplified product, at a concentration of approximately 
0.1pmol/|xl, was digested by T7 gene 6 exonuclease. In addition, the DNA 
generated with the (AC)11B primer containing phosphorothioate bonds was 
digested by T7 gene 6 exonuclease in concert with S1 nuclease: 

20 

without PT bonds with PT bonds with PT bonds 

4fil 4jii 4|il 5x T7 gene 6 buffer 

5.7^1 1.6^1 1.6mI DNA 

0, 2, 4, 8^ll 0, 2, 4, 8fil 0. 2, 4, 8\x\ Wuf\i\ T7 gene 6 exonuclease 

25 0|ii O^it 2\i\ 10u/^! S1 nuclease 

to 20^1 to20|al to20^i dH20 

Each reaction was incubated at 37''C for 10 minutes, after 
which 1)11 500mM EDTA pH8 was added to each tube followed by 
incubation at 70''C for 20 minutes. 
30 1 0al of each digest was subjected to electrophoresis on a 

2.5% agarose gel stained with ethidium bromide. Lanes corresponding to 
reactions lacking enzyme contained a discrete band of the expected 
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molecular weight. The appearance of a lower molecular weight band, 
corresponding to single stranded DNA, was seen at a concentration of 
1u/p.\ T7 gene 6 exonuclease for DNA primed by the (AC)11B primer that 
lacked phosphorothioate protection. At concentrations exceeding this 
5 virtually all DNA was single stranded. In contrast, DNA protected by 

phosphorothioate bonds at each end did not appear to alter significantly in 
molecular weight at any of the concentrations of T7 gene 6 exonuclease, 
but a decrease in the amount of DNA was evident with increasing 
concentrations. Similarly, DNA protected at each end was resistant to 
10 digestion of T7 gene 6 exonuclease in combination with S1 nuclease. 
Concentrations of 1u/)il T7 gene 6 exonuclease with 1u/^l S1 nuclease in 
T7 gene 6 exonuclease buffer containing approximately 0.1 pmol/|il DNA 
appeared to give the best results. 

15 The mis-match discrimination procedure was assessed using a model 
system comprising tliree alleies of tiie same VNTR In concert with a 
single allele of a second VNTR. 

A mixture of VNTR alleles was prepared that contained three 
alleies of the same VNTR. (AC)10, (AC)11, and (AC)18, in a 2 : 1 : 1 ratio 

20 respectively, in addition, an amount of the {CA)16 allele of a second 
VNTR, equal to that of the (AC)1 1 and (AC)1 8 alleles, was added to the 
mixture. Using Pfu DNA polymerase (Stratagene) 1ng of the mixture was 
amplified by PGR in a reaction volume of IOO11I containing 60 pmoles of 
each plasmid specific primer, the sense primer having been labelled with 

25 [y-33P] ATP. Thermal cycling was performed for 17 repetitions of 95°C for 
30s, 65°C for 30s, 72°C for 45s, followed by a final extension of 72°C for 5 
minutes. 

The amplified DNA was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with addition of dHzO 
30 between episodes of centrifugation. The recovered DNA was denatured at 
98°C for 2 minutes and then annealed at 75''C for 2 hours in lOOmM NaCI 
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and 200jiM CTAB, the transition between temperatures being rapid. 

The hybridised DNA was separated from low molecular 
weight solutes by microconcentration (Microcon-30; Amicon) with addition 
of dH20 between episodes of centrifugation, and digested by T4 
5 endonuclease VII in Taq DNA polymerase buffer containing 50u/^l1 of the 
enzyme in a total volume of 3Q\x\. Digestion proceeded at ST^'C for 1 hour 
after which the reaction was incubated at 75°C for 15 minutes. 

The digested DNA was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with addition of dH20 
10 between episodes of centrifugation. Further digestion was performed in a 
50|al reaction containing 1u/p.l T7 gene 6 exonuclease and 1u/|a,l S1 
nuclease in T7 gene 6 exonuclease buffer at ST'^C for 10 minutes. The 
reaction was stopped by addition of 2^1 500mM EDTA pH8 and heating to 
75°C for 10 minutes. 
15 Microconcentration was performed (Microcon-30; Amicon) 

with addition of dHaO between episodes of centrifugation. A volume of 48|al 
was recovered of which 4jj,l was amplified by PGR, as before. This was 
followed by a second round of the mis-match discrimination procedure. 

Aliquots of the amplified DNA before and after each round of 
20 the mis-match discrimination procedure were subjected to electrophoresis 
on an 8% poiyacryiamide denaturing get. In addition, for comparison of the 
molecular weight of each product, the PGR products of each allele 
amplified in isolation were loaded onto the gel. 

It was found that Pfu generated numerous stutter bands in 
25 each amplification reaction. The amount of the (AG)10 allele in the mixture 
prior to mis-match discrimination was approximately twice that of all other 
alleles. These others were present in approximately equal amounts. After 
the first round of mis-match discrimination obvious enrichment of the 
(AC) 10 allele was seen. This was enhanced by the second round of mis- 
30 match discrimination giving rise to a very strong band corresponding to the 
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(AC)10 allele and marked reduction of the {AC)11 and (AC)18 alleles. 
Although a band corresponding to the (CA)15 alieie of the second VNTR 
was present after the second round of mis-match discrimination it was not 
as bright as that of the enriched (AC) 10 allele. This was considered to 
5 reflect the inequality in the total DNA of each VNTR within the mixture and 
the consequential relative inefficiency of hybridisation following second 
order kinetics. This experiment confirmed that mis-match discrimination 
enriches the allele in a mixture of alleles of the same VNTR that has the 
highest frequency. 

10 

Example 4 

The protocol was assessed using the pooled genomes of several 
dogs. 

In the absence of DNA samples from individuals affected and 
15 unaffected by a hereditary trait the protocol was validated on a model 

system designed to mimic a scenario of VNTR linkage disequilibrium that 
would be expected in the presence of a recessive trait. 

A total of 43 dogs were genotyped with respect a VNTR 
previously isolated in the dog using VNTR specific primers. The VNTR 
20 primer pair comprised (CACTTGGGACTTTGGATTGGTCA) sense primer 
and (GTCTTTGTTTCCATTCTTGCTTGC) antisense primer. 

Amplification reactions by PGR were performed in a volume 
of 10|al containing 20ng genomic DNA and 4pmoles of each VNTR specific 
primer. In each case the VNTR specific sense primer was labelled and 
25 added to an amplification reaction master mix: 

1 .5|i! 10x T4 polynucleotide kinase buffer 

2A\x\ 50pmol/|il VNTR specific sense primer 

4.5^1 [y-33]ATP 

1 |il 1 in 3 dilution of 30 u/jj.1 T4 polynucleotide kinase 
30 5.6^1 dHsO 

15|Lll 
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The reaction was incubated at 37°C for 1 hour, then 90°C for 

5 minutes. 

The T4 polynucieotide kinase reaction was added to a PGR 

master mix: 

5 15|Lil T4 polynucleotide kinase reaction 

45|Lil 1 0x Taq DNA polymerase buffer 
45^! lOxdNTPs 

2.4^1 50pmol/|al VNTR specific antisense primer 
4.5|i,l 5u/|il Taq DNA polymerase 
iO 293^1 dH20 

405^1 

For each dog 1 |j,l of 20ng/|al genomic DNA was added to 9\x\ 
of PGR master mix which was overlaid with mineral oiL Each reaction was 
placed onto a preheated thermal cycler at OS'^G and incubated for 2 

15 minutes. Thermal cycling then followed with 28 repetitions of denaturation 
at 95°G for 30s, annealing at 65°C for 30s, and extension at 72°C for 30s, 
followed by a final extension of 72°G for 5 minutes. 

On completion of thermal cycling Sjii of formamide loading 
dye was added to each reaction with denaturation at 90°C for 3 minutes 

20 prior to electrophoresis at 60W on an 8% polyacrylamide denaturing gel. 
The gel was fixed in 10% methanol/10% glacial acetic acid and dried. An 
autoradiography film (BioMax MR; Kodak) was exposed to the gel 
overnight. 

The genotype of each dog was scored with respect to the 
25 VNTR. Ten dogs were selected to represent the 'affected poor of 

individuals and ten were selected to represent the 'wild type poor. This 
selection was made in order to achieve a scenario that may mimic a 
recessive trait: 
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10 



15 



20 



Affected 

(AC)n 

(AC)n+1 

(AC)n+2 

(AC)n+3 

(AC)n+4 

(AC)n+5 

(AC)n+6 

(AC)n+7 

Wild ty pe 

(AC)n 

(AC)n+1 

(AC)n+2 

(AC)n+3 

(AC)n+4 

{AC)n+5 

(AC)n+6 

(AC)n+7 



Allele frequency 

100% 

0% 

0% 

0% 

0% 

0% 

0% 

0% 

Allele frequency 

15% 

0% 

0% 

0% 

35% 

20% 

0% 

30% 



Ampiimers were prepared from genomic DNA of a single dog. 
in a lOOjal volume 5|ag of genomic DNA were digested by 20 units Hae 111, 
the digestion proceeding to completion over 12 hours at 37°C: 
4.4^,1 1 .1 35|ig/ial genomic DNA 
25 10^1 1 0x restriction buffer 

2^1 10u/nlHaelll 
Ml^i CIH2O 
100^1 

Digestion was confirmed by electrophoresis of an aliquot of 
30 the reaction on a 1% agarose gel stained with ethidium bromide. 

The DNA was extracted (GFX purification column) and eluted 
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in 50^1 5mM Tris pH8.5, of which approximately 3\xg contained within 30ial 
was incubated with Terminal deoxynucleotidy! transferase for 3 hours at 
37°C : 

30^1 DNA 

5 30fit 5x Terminal deoxynucieotidyl transferase buffer 

4.5^1 lOmMddGTP 

1 0jil 9ul\x\ Terminal deoxynucieotidyl transferase 

75.5|al dH^O 

150|il 

10 The DNA was separated from low molecular weight solutes 

by microconcentration (Microcon-30; Amicon) with successive additions of 
dH20 between episodes of centrifugation. A volume of 35ixi was 
recovered. 

An adapter was prepared by annealing two oligonucleotides, 
15 a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 12mer (TTCATACCTTCG): 

7.6|a! 1 97pmol/|j.l 24mer 

9.2^1 162pmol/)il 12mer 

1.87 jal 1Qx T4 DNA ligase buffer 
20 18.7|al 

The mixture was heated to 55°C and allowed to cool to 10°C 
over one hour. 

The adapter was ligated to the terminated genomic 

fragments: 
25 35^1 DNA 

18.7|al adapter 

4.3|al 10xT4 DNA ligase buffer 
1,5^1 10u/^lT4 DNA ligase 
2.5)il dHzO 
30 62iai 
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The reaction was incubated at 16°C over night, then heat 
inactivated at 70°C for 20 minutes. 

The DNA was separated from low molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
5 dH20 between episodes of centrifugation. A volume of 54\i\ was 
recovered. 

To prevent generation of spurious products through priming 
from sites of single strand nicks, these were terminated by incubation with 
Thermo Sequenase: 



10 


54^1 


DNA 




4A[i\ 


Thermo Sequenase buffer 




1.4^1 


10mM ddATP 




1.4^1 


10mM ddCTP 




1.4iil 


10mM ddGTP 


15 


1.4^1 


10mM ddTTP 




0.5^1 


32u/[j.l Thermo Sequenase 




5Jni 


dHzO 




70nl 





The mixture was overlaid with mineral oil and incubated at 
20 74*^0 for 2 hours. 

The DNA was extracted (GFX purification column) and eluted 
in 50|a! 5mM Tris pH 8.5. 

Amplimers were prepared from this DNA using VNTR primers 
and the 24mer oligonucleotide contained within the adapter as the adapter 
25 primer: 

5|iil 10x Taq DNA polymerase buffer 

5^1 lOxdNTPs 

2|il 25pmoI/|j.! adapter primer 

2^1 25pmol/!al VNTR primer [ (AC)1 1 B, (CA)1 1 D, (GT)1 1 H, 

30 or(TG)11V] 
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2^1 terminated, adapter-ligated DNA fragments (approx. 

50ng/|ai) 

M^i dHaO 
50^1 

5 Similar reactions were prepared containing a VNTR primer 

but in the absence of genomic DNA. in addition, a single reaction was 
performed containing genomic DNA but in the absence of a VNTR primer, 
All reactions were overlaid with mineral oil and incubated at 95°C for 2 
minutes. Addition of O.S^il of Su/jal Taq DNA polymerase was made to each 
10 reaction. Amplification was achieved by thermal cycling for 18 repetitions 
of 95^*0 for 30 s, 65°C for 45s, 72°C for 45s, followed by a final extension of 
72°C for 5 minutes. 

On completion of amplification 5|al of each reaction were 
subjected to electrophoresis with a molecular weight marker on a 1.5% 
15 agarose get stained with ethidium bromide. The presence of amplified 

products in the lanes representing reactions containing template DNA and 
a VNTR primer confirmed that ligation of the genomic fragments to adapter 
sequence had occurred. In each case the appearance of these lanes was 
similar, there being a smear of amplified products distributed over a range 
20 of molecular weights from approximately 1 0Obp to 500bp. All other lanes 
lacked product of amplification. The fact that the reaction containing 
template DNA but no VNTR primer did not generate product confirmed that 
the ail 3' ends had been terminated successfully such that chain extension 
in the presence of Taq DNA polymerase was prevented. 
25 The {AC)1 1 B and (CA)1 1 D primed reactions were combined. 

Also, the (GT)1 1 H and (TG)1 1 V primed reactions were combined. Both 
amplimer pools were separated from low molecular weight solutes by 
microconcentration (Microcon-30; Amicon) with addition of dH20 between 
episodes of centrifugation. Quantification by agarose gel electrophoresis of 
30 the recovered DNA suggested that each contained approximately 35ng/|j.l 
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amplimer DNA. 

The repeat sequences were removed from the pooled 
(AC)11B and {CA)1 ID primed products using T4 DNA polymerase and 
Exonuciease VII: 

5 14|ai 35ng/|ai (AC)11 B/(CA) 11 D primed amplimer DNA 

2\i\ 1 0x T4 DNA polymerase buffer 
VI lOmMdATP 
1^x1 lOmMdCTP 

2]x\ 1 in 4 dilution of 4u/jal T4 DNA polymerase 

10 20|j.l 

The reaction was incubated at 12°C for 1 hour then 
inactivated at 70°C for 20 minutes. 

To the reaction was added ^]x] of lOu/jal Exonuciease Vll with 
incubation at 37°C for 30 minutes followed by 70^*0 for 20 minutes. 

15 The designated affected and wild type DNA pools were 

prepared by combining equal amounts of genomic DNA, quantified by 
spectrophotometry, of the selected dogs. These were phenol/chloroform 
extracted and microconcentrated (Microcon; Amicon) with addition of dH20 
between episodes of centrifugation. 

20 Each pool of genomic DNA was digested by Hae 111, 

terminated using Terminal deoxynucleotidyl transferase, and ligated to the 
adapter in a manner similar to that previously described. Complete 
termination of all 3' ends was confirmed by PGR with the adapter primer. 
The DNA pools were quantified by agarose gel electrophoresis and were 

25 found to contain approximately equal concentrations. 

In a minima! volume 2.5m,1 of the 35ng/^l {AC)/(CA) primed 
amplimer pool, digested with T4 DNA polymerase and Exonuciease VII, 
were hybridised in 0.6M NaCl to approximately 300ng of the 'affected' 
genomic DNA pool that had been fragmented, terminated, and ligated to 

30 the adapter. This was achieved by denaturing the mixture under mineral oil 
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at 98°C for 3 minutes, followed by a stepwise reduction in the temperature 
from 80°C to 70°C over ten liours and sustaining the final temperature for a 
further 10 hours. The wild type pool was hybridised in a similar manner in 
parallel. 

5 To each hybridisation were added: 

20|al 10x Taq DNA polymerase buffer 
20^1 lOxdNTPs 
160^1 dHaO 
200^1 

10 In each case the total volume containing the hybridised DNA 

was divided between two reaction tubes. Under mineral oil each volume 
was heated to 75''C, ^\x\ of 5u/[4.l Taq DNA polymerase was added to each 
tube followed by incubation at 72°C for 10 minutes. 
The reactions were denatured at QS'^C for 3 minutes and 4|j.I of 25pmol/|i! 

15 adapter primer were added. Amplification of the hybridised DNA was 

achieved by thermal cycling for 30 repetitions of 95°C for 30s, BS'^C for 30s, 
72''C for 90s, followed by a final extension of 72°C for 5 minutes. 

The reactions containing affected DNA were pooled, as were 
the reactions containing wild type DNA, and 8|il of 10u/|ii Exonuclease I 

20 were added to each 200)j,l volume of amplified DNA. The reactions were 
incubated at 37^C for 15 minutes. 

For each reaction the DNA was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dH20 between 
episodes of centrifugation. In each case a volume of 1 0jil was recovered. 

25 The alleles contained within each sample were denatured and allowed to 
anneal by incubation under mineral oil at 98'='C for 5 minutes followed by a 
rapid reduction in temperature to 75°C. At 75°C 2M NaCI and lOmM CTAB 
were added to give final concentrations of 50mM and 500|iM, respectively. 
The hybridisation reactions were incubated at 75°C for a further 16 hours. 

30 To each hybridisation reaction was added 150ial of 5mM Tris 
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pH 8.5. The diluted hybridisation reactions were then separated from low 
nnoiecular weight solutes (Microcon-30; Amicon) with addition of dHaO 
between episodes of centrifugation. These were judged to contain 
approximately lOpmoles DNA. Digestion by T4 endonuciease VII at a 

5 concentration of 50u/|il in Taq DNA polymerase buffer was performed in a 
volume of 100]j,L The digestion proceeded at ST^'C for 30 minutes prior to 
incubation at 65°C for 15 minutes. 

Each digest was separated from low molecular weight solutes 
{Microcon-30; Amicon) with addition of dH20 between episodes of 

10 centrifugation. The recovered volume in each case was divided between 
three tubes, each being digested either by 0.5u/^l Exonuciease 1 in 1x Taq 
DNA polymerase buffer, 1u/jal T7 gene 6 exonuciease followed after heat 
inactivation at 70°C for 10 minutes by 0.5u/|il Exonuciease I in 1x T7 gene 
6 exonuciease buffer, or 1u/[il T7 gene 6 exonuciease together with lu/jjil 

15 S1 nuclease in 1x T7 gene 6 exonuciease buffer. The concentration of 

DNA in each reaction was approximately 0.1pmol/|Li! contained within a 30|al 
volume. The Exonuciease I reactions were performed at 37°C for 15 
minutes prior to heat inactivation at 70°C for 10 minutes. The reactions 
containing T7 gene 6 exonuciease with or without S1 nuclease were 

20 performed at 37°C for 10 minutes. On completion of each regime of 
digestion the DNA was extracted (GFX purification column) and eluted in 
50^1 dHsO. 

Three quarters of each of the extracted DNA samples was 
amplified by PGR with Taq DNA polymerase 
25 37.5|il digested DNA 

^5\x\ 10x Taq DNA polymerase buffer 

15ki1 lOxdNTPs 

6\i\ 25pmol/^l adapter primer 

76.5[il dHoO 
30 150)^1 
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The reactions were divided into 75ial atiquots and overlaid 
with mineral oil to which were added 0.75|al of Su/j^l Taq DNA polynnerase 
after incubation at 95°C for 2 nninutes. Amplification was achieved by 
thermal cycling for 25 repetitions of 95°C for 30s, 65^=^0 for 30s, 72X for 
5 90s, followed by a final extension of 72°C for 5 minutes. 

To each 150^1 of amplified DNA were added 6|il 10u/)il 
Exonuclease I. The reactions were incubated at 37°C for 15 minutes. 

The DNA in each case was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dH20 between 
10 episodes of centrifugation. Repetition of hybridisation in 50mM NaCI and 
SOO^iM CTAB followed by each regime of digestion was repeated, followed 
by amplification of the resulting DNA by PGR with Taq DNA polymerase, as 
above. 

Aliquots of each of the amplified samples were subjected to 

15 electrophoresis on a 1 .5% agarose gel stained with ethidium bromide with 
a molecular weight marker. The amplified products in the lanes 
corresponding to DNA digested by T4 endonuclease VII followed by 
Exonuclease I were of high molecular weight smearing towards the well. In 
contrast, the lanes corresponding to amplified product that had been 

20 /tJigested by either T7 gene 6 exonuclease followed by Exonuclease I or T7 
gene 6 exonuclease concomitantly with S1 nuclease contained products 
ranging in molecular weights from approximately 200bp to 750bp. The 
distribution of molecular weights in each case was simitar No smearing 
towards the well was seen suggesting that the spurious products of 

25 amplification that were seen in the absence of T7 gene 6 exonuclease were 
eliminated by the presence of this enzyme. As such, T7 gene 6 
exonuclease was considered an essential component of the mis-match 
discrimination regime for removal of repeat sequences from T4 
endonuclease Vll cleaved molecules that would otherwise cross-hybridise 

30 and produce spurious DNA molecules. 

To each of the 150fil volumes of amplified DNA resulting from 
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the second round of mis-match discrimination were add 6p.l of 10u/|al 
Exonuciease I and tlie reactions were digested at 37°C for 15 minutes. 

Tine DNA in eacli case was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dHaO between 

5 episodes of centrifugation. 

For each of the reactions corresponding to the 'affected' dogs 
amplification was performed by PGR with Taq DNA polymerase using the 
VNTR specific primers in a volume of 50|il containing approximately 25ng 
DNA. Amplification by 28 repetitions of thermal cycling was performed 

10 after which 5|j.l aliquots and a molecular weight marker were loaded onto a 
2% agarose gel stained with ethidium bromide. 

For the lanes corresponding to digestion by T4 endonuclease 
VII and Exonuciease I the product of the expected molecular weight was 
very faint. In addition a large amount of spurious product in the vicinity of 

15 the wells was seen. For all other lanes no high molecular weight products 
were seen. Furthermore, the amplified products were seen clearly as a 
discrete band of the expected molecular weights of approximately 130bp. 

The products of amplification corresponding to digestion by 
T4 endonuclease VII and Exonuciease I were discarded. The remaining 

20 reactions were amplified further using the VNTR specific primers, one of 
which was labelled with [y-33P] ATP using T4 polynucleotide kinase. 
Amplification reactions were performed by PGR using Taq DNA 
polymerase in volumes of 20^il containing lOpmoles of each primer for 35 
repetition of thermal cycling. In addition, reactions were performed in the 

25 same manner containing 40ng of the pooled 'affected' and pooled 'wild 
type' DNA. After addition of 10^1 of fomiamide loading dye to each sample 
the amplified products were denatured at 90°G for 3 minutes. 6|al aliquots 
of the mixture were subjected to electrophoresis on an 8% polyacrylamide 
denaturing gel. The gel was fixed and dried and exposed to an 

30 autoradiography film. 

it was found that product was visible for DNA amplified from 
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affected DNA following the second round of mis-match discrimination. This 
was seen in both the lanes corresponding to digestion by T7 gene 6 
exonuciease followed by Exonuclease I and those corresponding to 
digestion by T7 gene 6 exonuclease concomitantly with S1 nuclease. In 

5 each case the product resembled that resulting from amplification of the 
pooled affected DNA that had not been subjected to mis-match cleavage. 
In the case of wild type DNA amplified after the second round of mis-match 
discrimination no products were discernible. 

This experiment confirmed that VNTRs are reproduced with 

10 fidelity from the pooled genomes of several individuals, the alleles in each 
case being preserved, and mis-match discrimination serves to eliminate 
spurious products of amplification and enrich the VNTR allele of the highest 
frequency. Although no products were visible for DNA derived from the 
wild type DNA, it may be that products would become visible with higher 

15 loading of DNA on the poiyacrylamide gel. As such, further repetition of the 
mis-match discrimination procedure would be necessary to reduce to near 
homozygosity the alleles in both DNA pools such that final selection of the 
informative allele could be achieved. 

20 Example 5 

Demonstration of the resistance to Exonuclease Hi of DNA with a 3' 
overhang derived by ligation to an adapter. 

A cloned VNTR allele was amplified by Taq DNA polymerase. 
The amplified DNA was separated from low molecular weight solutes by 
25 microconcentration (Microcon-30; Amicon) with successive additions of 
dHaO between episodes of centrifugation. 

The volume recovered was measured at 44|al, the 
concentration of which was determined by agarose gel electrophoresis to 
be 160ng/^l, approximating to 1,6pmol/}il. 

30 The amplified DNA was blunted by T4 DNA polymerase 

digestion: 
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42^il DNA 
3.25(al 10mM dATP 
3.25^11 OmM dCTP 
3.25)^1 lOmMdGTP 
5 3.25^1 lOmMdTTP 

1 3ii\ 1 0x T4 DNA polymerase buffer 
3.25|il 4u/|il T4 DNA polymerase 
59^1 dHsO 
130)11 

10 The reaction was incubated at 12°C for 30 minutes, then heat 

inactivated at 70°C for 20 minutes. The DNA was separated from iow 
molecular weight solutes by microconcentration (Microcon-30; Amicon) with 
successive additions of dHaO between episodes of centrifugation. A 
volume of 30|j,t was recovered. 
15 1 600pmoles of a 21 mer oligonucleotide 

(CTCGCAAGGATGGGATGCTCG) were phosphorylated with T4 
polynucleotide -kinase diluted to 10u/|j,l in the supplied dilution buffer: 
3.19|j,l 21 mer oligonucleotide 
1 .5^tl 1 0x T4 DNA ligase buffer 
20 1|al 10u/|al T4 polynucleotide kinase 

9^ii\ dHaO 
15^1 

The reaction was incubated at 37°C for 30 minutes, then heat 
inactivated at 9Q°C for 10 minutes. 
25 To the kinase reaction was added 1600pmoles of a 12mer oligonucleotide 
(CATCCTTGCGAG). Annealing of the oligos to form an adapter was 
achieved by heating to 55°C and allowing the mixture to cool to 10°C over 
a period of 1 hour. 

Half of the DNA blunted by T4 DNA polymerase was saved. 
30 To the annealed adapter was added the remaining ^5\x\ of blunted DNA 
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such that the adapter was in a 50 fold excess: 
15jii blunted DNA 
16.2|j,l annealed adapter 
1.9^1 lOx T4 DNA ligase buffer 
5 l^ii 1 0u/fil T4 DNA ligase 

34^1 

The ligation reaction was incubated over night at 16°C. 
The ligation was heat inactivated at 70°C for 20 minutes and 
the DNA was separated from low nnolecular weight solutes by 
10 microconcentration (Microcon-30; Amicon) with successive additions of 
dH20 between episodes of centrifugation. 

The volume recovered was measured to be 36|j.l. 
The ligated DNA and 15|il of non-iigated DNA that had been 
saved were both made to approximately 0.75pmo[es/|al by addition of 
15 dH20. Each was digested by Exonuclease 111 at a final concentration of 
DNA approximating to 0.2pmol/fii: 
10.7^1 DNA 

4|al lOx Exonuclease 111 buffer 
200u/^l Exonuclease III 
20 24.3|al dH90 

40^1 

The reaction was incubated 37°C for 5 minutes then heat 
inactivated at 70*'C for 20 minutes. 

Approximately 2pmoles of each digest were loaded onto a 2% 
25 agarose gel stained with ethidium bromide. All non-ligated DNA was 

digested to completion by Exonuclease 111 such that none was detectable 
on the agarose gei. In contrast, although some digestion had occurred, 
much of the ligated DNA was found to be resistant to digestion. That which 
had been digested was assumed to have failed to ligate to the 
30 phosphorylated adapter. This experiment confirmed that ligation of an 
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adapter is one method by which DNA molecules may become resistant to 
Exonuclease 111 digestion, those molecules lacking an adapter being 
digested to completion by this enzyme. 

5 Selection of unique sequences in a pool of DNA hybridised to a 
second pool of DNA using Exonuclease III. 

Two cloned VNTR alleles that differed in their repeat lengths 
by four nucleotides were amplified by PGR using Taq DNA polymerase. 
The amplified DMAs were separated from low molecular weight solutes by 
10 microconcentration (Microcon-30; Amicon) with successive additions of 
dHaO between episodes of centrifugation and the resulting concentrations 
of DNA were determined by agarose gel electrophoresis. 

To a portion of the amplified products of the smaller allele 
was added a 3' overhang by incubation with Terminal deoxynucleotidyl 
15 transferase: 

12.5ial 120ng/[xl DNA (approx. 1.2pmoi/[il) 

1 5|il 5x Terminal deoxynucleotidyl transferase buffer 

1.125|a! lOmMdATP 

3.3p.l 9u/jal Terminal deoxynucleotidyl transferase 
20 43^1 dHaO 

75fxl 

The reaction was incubated at 37°C for 1 hour after which the 
DNA was extracted (GFX purification column). 

To 450ng of the allele possessing a 3' overhang was added: 
25 (i) 4.5!ag of the same allele that lacked a 3' overhang; 

(ii) 4.5iag of the larger allele that lacked a 3' overhang. 

In each case, the total volume was minimised by 
microconcentration (Microcon-30; Amicon). These mixtures were 
denatured at 98°C for 3 minutes and annealed at TS'^C for 2 hours in the 
30 presence of 0.2M NaCl and 100|iM CTAB. 
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To each hybridisation reaction were added; 

1 0|al 1 0x Taq DNA polymerase buffer 

1 0fil 500u/|ii T4 endonuclease VI I 

80^1 dH20 
5 100^l 

The reactions were incubate at 37^C for 45 minutes, then 
inactivated at 70''C for 15 minutes. 

The DNAs were separated from iow molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
10 dH20 between episodes of centrifugation. In each case a volume of 

approximately 40|il was recovered which was diluted in a reaction mixture 
containing 5u/jj.i Exonuclease til: 

40^1 DNA 

15|iil lOx Exonuclease III buffer 
15 3.75)al 200u/!al Exonuclease III 

91^1 dHaO 
150^1 

The reactions were incubated at 37°C for 5 minutes, after 
which they were microconcentrated (Microcon-30; Amicon). The entire 

20 recovered volumes were subjected to electrophoresis on a 1,5% agarose 
gel stained with ethidium bromide. In addition, a molecular weight marker, 
400ng of the small allele without a 3* overhang, and 400ng of the smaller 
allele that possessed an overhang were loaded on to the get. 

The size of the smaller amplified allele was confirmed to be 

25 approximately 150bp by comparison to the molecular weight marker. After 
incubation with Terminal deoxynucieotidyi transferase the apparent size of 
this amplified allele had increased. A smear of products distributed over a 
range of sizes corresponding to between 400bp and 750bp of double 
stranded DNA was seen, though the majority of DNA was confined to an ill- 

30 defined band midway between these. In the lane containing hybridised 
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alleles of different sizes that had been digested, a band corresponding to 
approximately 300bp of double stranded DNA was seen against a back 
ground smear of products. This band was considered to be the result of 
enzymatic cleavage of the mis-match containing DNA duplexes, where as 
5 the back ground smear was considered to be single stranded DNA 

resulting from Exonuclease ill digestion of molecules lacking the protection 
of a 3' overhang. In the lane that contained hybridised alleles of the same 
size two ill-defined bands were visible against a background smear of 
products. The brightest band was of an appearance similar to that of the 

10 smaller allele following its incubation with Terminal deoxynucleotidyl 

transferase and was considered to represent the remaining single stranded 
DNA from heteroduplex molecules digested by Exonuclease 111. The fainter 
band was considered to the result of enzymatic cleavage of molecules 
possessing polymerase errors. As before, the background smear was 

15 considered to be due to single stranded DNA of molecules lacking a 3' 
overhang that had resulted from digestion by Exonuclease II!. This 
experiment suggests that an allele possessing a 3' overhang entering into a 
heteroduplex with an allele of a different repeat length is digested by T4 
endonuclease VII and Exonuclease III such that a fragment of the 

20 heteroduplex may be selected. 
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Appendix 



Consider a scenario that may typify a rare recessive trait. The affected 
group of individuals are homozygous for the same allele. In the wild type 
group, this allele has a relatively low frequency. 





starting scenario 




Affected 






Wild Type 






Alleles 


A 


B 


c 


D 


A 


B C 


D 


iO 


Allele frequencies 


1.0 


0.0 


0.0 


0.0 


0.15 


0.35 0.2 


0.3 




Allele ratios 


1 


0 


0 


0 


3 


7 4 


6 




After 1^* Round 




Affected 






Wild Type 






Alleles 


A 


B 


c 


D 


A 


B C 


D 


15 


Amount remaining 


1.000 


0,000 


0,000 


0.000 


0.023 


0.123 0.040 


0.090 




Total remaining 


1.0 








0.276 








Allele ratios 


1 


0 


0 


0 


23 


123 40 


90 




Allele frequencies 


rooo 


0.000 


0.000 


0.000 


0.083 


0.446 0.145 


0,326 


20 


After 2""* Round 




Affected 






Wild Type 






Alleles 


A 


B 


c 


D 


A 


B C 


D 




Amount remaining 


1.0 


0.0 


0.0 


0.0 


0.006 


0.199 0.021 


0.106 




Total remaining 


1,0 








0.332 








Allele ratios 


1 


0 


0 


0 


6 


199 21 


106 


25 


Allele frequencies 


10 


0.0 


0.0 


0.0 


0.018 


0.599 0.063 


0.319 




After 3 Round 




Affected 






Wild Type 






Alleles 


A 


B 


c 


D 


A 


B G 


D 




Amount remaining 


1.0 


0.0 


0.0 


0.0 


0.000 


0.359 0.004 


0.102 


30 


Total remaining 


1.0 








0.465 








Allele ratios 


1 


0 


0 


0 


0 


359 4 


102 




Allele frequencies 


1.0 


0.0 


0.0 


0.0 


0.000 


0.772 0.008 


0,219 
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After 4**" Round Affected Wild Type 

Alleles ABCD ABCD 

Amount remaining 1.0 0.0 0.0 0.0 0.000 0.596 0.000 0.010 

Total remaining 1.0 0.606 

5 Allele ratios 1 0 0 0 0 596 0 10 

Allele frequencies 1.0 0.0 0.0 0.0 0.000 0.983 0.000 0.017 

Comparison of tlie 1 x1 x 1 x 1= 1 0.276 x 0.332 x 0.465 x 0.606 = 0.026 
ratios of remaining 
10 alleles 38.5:1 

all of which is A none of which is A 



Therefore, even if an large excess of wild type DNA is 
hybridised to the affected DNA that survives the mis-match discrimination 
15 procedure it is extremely likely that the allele present in the affected group 
will be recovered. 

Consider another scenario in which one allele is present in 
the affected group of individuals at a frequency greater than that of the wild 
type group. 



20 



25 



Starting scenario Affected Wild Type 

Alleles ABCDE ABODE 

Allele frequencies 0.050 0.100 0.000 0.150 0.700 0.250 0.200 0.150 0.250 0.150 

Allele ratios 1203 14 54353 



After 1** Round Affected Wild Type 

Alleles ABCDE ABCDE 

Amount remaining 0.003 0.010 0.000 0.023 0.490 0.063 0.040 0.023 0.063 0.023 

Total remaining 0.526 0.212 

30 Allele ratios 3 10 0 23 490 63 40 23 63 23 

Allele frequencies 0.006 0.019 0.000 0.044 0.932 0.297 0.189 0.108 0.297 0.108 
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After 2"^ Round Affected wild Type 

Alleles ABCDE ABCDE 

Amount remaining 0.000 0.000 0.000 0.002 0.869 0.088 0.036 0.012 0.088 0.012 

Total remaining 0.871 0.236 

Allele ratios 0 0 0 2 869 22 9 3 22 3 

Allele frequencies 0.000 0.000 0.000 0.002 0.998 0.373 0.153 0.051 0.373 0.051 



Wild Type 



After 3^"* Round Affected 

Alleles ABCDE ABCDE 

10 Amount remaining 0.000 0.000 0.000 0.000 0.996 0.139 0.023 0.003 0.139 0.003 

Total remaining 0.996 0.307 

Allele ratios 0 0 0 0 1 139 23 3 139 3 

Allele frequencies 0.000 0.000 0.000 0.000 1.000 0.453 0.075 0.010 0.453 0.010 

15 After 4^'' Round Affected Wild Type 

Alleles ABCDE ABCDE 

Amount remaining 0.000 0.000 0.000 0.000 1.000 0.205 0.006 0.000 0,205 0.000 

Total remaining 1.0 0.416 

Allele ratios 0 0 0 0 1 205 6 0 205 0 

20 Allele frequencies 0.000 0.000 0.000 0.000 1000 0.493 0.014 0.000 0.493 0.000 

Comparison of the 0.526 x 0.871 x 0.996 x 1 = 0.4560.212 x 0.236 x 0.307 x 0.416 = 0.006 
ratios of remaining 

alleles 76 ; ^ 

25 a!! of which is E none of which is E 



Therefore, even if an large excess of wild type DNA is 
hybridised to the affected DNA that survives the mis-match discrimination 
procedure it is extremely likely that allele E present in the affected group 
30 will be recovered. 
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CLAIMS 

5 1 . A method of making a mixture of VNTR alleles and their 

flanking regions of the genomic DNA of one or more members of a species 
of interest, which method comprises the steps of: 
a) dividing genomic DNA of the species of interest into 

fragments, 

10 b) ligating to each end of each fragment an adaptor thereby 

forming a mixture of adaptor-terminated fragments in which each 3'-end is 
blocked to prevent enzymatic chain extension, 

c) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 

15 create a mixture of 5'-flanking VNTR amplimers, 

d) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers, 

e) and using genomic DNA of the one or more members of the 
20 species of interest as template with the mixture of 5'-flanking VNTR 

amplimers and/or the mixture of 3 -flanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

2. The method of claim 1 , wherein step b) is performed by 
terminating each 3 -end of each fragment to prevent enzymatic chain 

25 extension, and ligating each 5'-end of each fragment to an adaptor, thereby 
forming a mixture of adaptor terminated fragments. 

3, The method of claim 1 or claim 2, wherein in step c) the 
VNTR repeat sequences are removed from the 5'- flanking VNTR 
amplimers, and in step d) the VNTR repeat sequences are removed from 

30 the 3 - flanking VNTR amplimers. 
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4. The method of any one of claims 1 to 3, wherein in step c) 
and/or d) the adaptor or primer used contains at least one 
phosphorothioate bond. 

5. The method of any one of claims 1 to 4, wherein step e) is 
5 performed using as primers, either successively or together, both the 

mixture of 5'- flanking VNTR amplimers and the mixture of 3 - flanking 
VNTR amplimers. 

6. The method of any one of claims 1 to 5, wherein there is used 
in step e) genomic DNA of one or more members of the species of interest 

10 which manifest a trait of interest, whereby the resulting mixture of VNTR 
alleles and their flanking sequences is representative of those which 
manifest the trait of interest. 

7. The method of claim 6 wherein in a step f) the strands of the 
mixture of VNTR alleles and their flanking regions are separated and then 

15 re-annealed and any mis-matches are separated and discarded. 

8. The method of claim 7, wherein step f) is repeated to recover 
a single VNTR allele and its flanking regions. 

9. The method of any one of claims 6 to 8, wherein at least one 
VNTR allele and its flanking sequences representative of those which 

20 manifest the trait of interest, is hybridised with a mixture of VNTR alleles 
and their flanking sequences representative of those which do not manifest 
the trait of interest, and at least one match and/or at least one mis-match is 
selected to provide at least one VNTR allele or fragment thereof which is 
characteristic of the trait of interest. 

25 10. The method of claim 9, wherein the at least one VNTR allele 

and its flanking sequences representative of those which manifest the trait 
of interest, is provided with 3 - overlapping ends. 
11. A portion of genomic DNA of one or more members of a 

species of interest said portion consisting essentially of a representative 

30 mixture of alleles of a chosen VNTR sequence and their flanking regions on 
both sides. 
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12. The portion as claimed in claim 11, wherein the mixture of 
alleles is representative of those which manifest a trait of interest. 

13. The portion as claimed in claim 11 or claim 12, wherein each 
member of the mixture has an adaptor at each of its 3'-end and its 5'-end. 

5 14:- A portion of genomic DNA of one or more members of a 

species of interest, said portion consisting essentially of a single VNTR 
allele and its flanking regions and an adaptor at each of its S'-end and its 
5'-end, said allele being characteristic of those which manifest a trait of 
interest. 

10 1 5. A portion of genomic DNA of a species of interest, said 

portion consisting essentially of a representative mixture of 3 -flanking 
regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 3'-end, and a representative mixture of 5'-flanking regions 
of a chosen VNTR sequence, each member of the mixture carrying an 

15 adaptor at its 5'-end. 

16. A method of treating nucleic acids which consist essentially of 
a mixture of polymorphic alleles, the mixture being representative of those 
which manifest a trait of interest, which method comprises separating and 
then re-annealing strands of the mixture, and separating and discarding 

20 any mis-matches. 

17. The method of claim 16, wherein the mixture of polymorphic 
alleles is a mixture of alleles of a chosen VNTR sequence and their flanking 
regions. 

18. The method of claim 17, wherein the method is repeated to 
25 recover a single VNTR allele and its flanking regions. 

1 9. The method of any one of claims 1 6 to 1 8, wherein at least 
one VNTR allele and its flanking sequence representative of those which 
manifest the trait of interest, is hybridised with a mixture of VNTR alleles 
and their flanking sequences representative of those which do not manifest 

30 the trait of interest, and at least one match and/or at least one mis-match is 
selected to provide at least one VNTR allele or fragment thereof which is 
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characteristic of ttie trait of interest. 

20. The method of claim 1 9, wherein the at least one VNTR allele 
and its flanking sequence representative of those which manifest the trait of 
interest, is provided with 3'-overlapping ends. 

21 . A method of making a mixture of amplimers which method 
comprises the steps of: 

a) dividing genomic DNA of one or more members of a species 
of interest into fragments, 

b) ligating to each end of each fragment an adaptor thereby 
forming a mixture of adaptor-terminated fragments in which each 3'-end is 
blocked to prevent enzymatic chain extension, and 

c) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 
create a mixture of 5'-flanking VNTR amplimers, and/or 

d) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers. 

22. A method of identifying an allele which is linked to a trait of 
interest, which method comprises incubating together under hybridisation 
conditions: at least one molecule of nucleic acid containing a polymorphic 
allele and its flanking sequences representative of those which manifest the 
trait of interest; and a mixture of molecules of nucleic acid which contain 
polymorphic alleles and their flanking sequences representative of those 
which do not manifest the trait of interest; and selecting at least one match 
and/or at least one mis-match to provide at least one allele or fragment 
thereof which is linked to the trait of interest. 

23. The method of claim 22, wherein the alleles are VNTR alleles. 

24. The method of claim 22 or claim 23, wherein the at least one 
allele and its flanking sequences representative of those which manifest the 
trait of interest, is provided with 3'- overlapping ends. 
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25. Use of the portion of genomic DNA as claimed in claim 14 in 
a diagnostic assay. 

26. The method of any one of claims 1 to 10 or 16 to 20, wherein 
^the VNTR allele and its flanking regions, or the mixture of VNTR alleles and 
their flanking regions, is analysed by being applied under hybridisation 
conditions to an array of immobilised VNTR alleles and/or their flanking 
regions. 

27. A kit comprising protocols and reagents for performing the 
method of any one of claims 1 to 10, 16 to 24 or 26. 
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APPENDIX A 



1 . A method of making a mixture of VNTR alleles and their flanking regions 
of the genomic DNA of one or more members of a species of interest, which method comprises 
the steps of: 

a) dividing genomic DNA of the species of interest into fragments, 

b) ligating to each end of each fragment an adaptor thereby forming a mixture 
of adaptor-terminated fragments in which each 3 '-end is blocked to prevent enzymatic chain 
extension, 

c) using a portion of the mixture of adaptor-terminated fragments as templates 
with an adaptor primer and a VNTR primer to create a mixture of 5 '-flanking VNTR amplimers, 

d) using a portion of the mixture of adaptor-terminated fragments as templates 
with an adaptor primer and a VNTR antisense primer to create a mixture of 3 '-flanking VNTR 
amplimers, 

e) and using genomic DNA of the one of more members of the species of 
interest as template v^th the mixture of 5 -flanking VNTR amplimers and/or the mixture of 3'- 
flanking VNTR amplimers as primers to make the desired mixture of VNTR alleles and their 
flanking regions. 

2. The method of claim 1, wherein step b) is performed by terminating each 
3 '-end of each fragment to prevent enzymatic chain extension, and ligating each 5'-end of each 
fragment to an adaptor, thereby forming a mixture of adaptor terminated fragments. 

3 . The method of claim 1, wherein in step c) the VNTR repeat sequences are 
removed from the 5'-flanking VNTR amplimers, and in step d) the VNTR repeat sequences are 
removed from the 3 '-flanking VNTR amplimers. 

4. The method of claim 1, wherein in step c) and/or d) the adaptor or primer 
used contains at least one phosphorothioate bond. 

5. The method of claim 1, wherein step e) is performed using as primers, 
either successively or together, both the mixture of 5 '-flanking VNTR amplimers and the mixture 
of 3 '-flanking VNTR amplimers. 

6. The method of claim 1, wherein there is used in step e) genomic DNA of 
one or more members of the species of interest which manifest a trait of interest, whereby the 
resulting mixture of VNTR alleles and their flanking sequences is representative of those which 
manifest the trait of interest. 

7. The method of claim 6 wherein in a step f) the strands of the mixture of 
VNTR alleles and their flanking regions are separated and then re-annealed and any mis-matches 
are separated and discarded. 

8. The method of claim 7, wherein step f) is repeated to recover a single 
VNTR allele and its flanking regions. 

9. The method of claim 6, wherein at least one VNTR allele and its flanking 
sequences representative of those which manifest the trait of interest, is hybridised v^th a mixture 
of VNTR alleles and their flanking sequences representative of those which do not manifest the 



trait of interest, and at least one match and/or at least one mis-match is selected to provide at least 
one VNTR allele or fragment thereof which is characteristic of the trait of interest. 

10. The method of claim 9, wherein the at least one VNTR allele and its 
flanking sequences representative of those which manifest the trait of interest, is provided with 
3 '-overlapping ends. 

11. A portion of genomic DNA of one or more members of a species of 
interest, said portion consisting essentially of a representative mixture of alleles of a chosen 
VNTR sequence and their flanking regions on both sides. 

12. The portion as claimed in claim 11, wherein the mixture of alleles is 
representative of those which manifest a trait of interest, 

13. The portion as claimed in claim 1 1, wherein each member of the mixture 
has an adaptor at each of its 3 -end and its 5 -end. 

14. A portion of genomic DNA of one or more members of a species of 
interest, said portion consisting essentially of a single VNTR allele and its flanking regions and 
an adaptor at each of its 3*-end and its 5'-end, said allele being characteristic of those which 
manifest a trait of interest. 

15. A portion of genomic DNA of a species of interest, said portion consisting 
essentially of a representative mixture of 3 -flanking regions of a chosen VNTR sequence, each 
member of the mixture carrying an adaptor at its 3'-end, and a representative mixture of 5*- 
flanking regions of a chosen VNTR sequence, each member of the mixture carrying the same 
adaptor at its 5 -end. 

16. A method of treating nucleic acids which consist essentially of a mixture 
of polymorphic alleles, the mixture being representative of those which manifest a trait of interest, 
which method comprises separating and then re-annealing strands of the mixture, and separating 
and discarding any mis-matches. 

17. The method of claim 16, wherein the mixture of polymorphic alleles is a 
mixture of alleles of a chosen VNTR sequence and their flanking regions. 

18. The method of claim 17, wherein the method is repeated to recover a single 
VNTR allele and its flanking regions, 

19. The method of claim 16, wherein at least one VNTR allele and its flanking 
sequence representative of those which manifest the trait of interest, is hybridised with a mixture 
of VNTR alleles and their flanking sequences representative of those which do not manifest the 
trait of interest, and at least one match and/or at least one mis-match is selected to provide at least 
one VNTR allele or fragment thereof which is characteristic of the trait of interest. 

20. The method of claim 19, wherein the at least one VNTR allele and its 
flanking sequence representative of those which manifest the trait of interest, is provided v^th 3*- 
overlapping ends. 

21. A method of making a mixture of amplimers which method comprises the 

steps of: 
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a) dividing genomic DNA of one or more members of a species of interest 

into jfragments, 

b) ligating to each end of each fragment an adaptor thereby forming a mixture 
of adaptor-terminated fragments in which each 3 '-end is blocked to prevent enzymatic chain 
extension, and 

c) using a portion of the mixture of adaptor-terminated fragments as templates 
with an adaptor primer and a VNTR primer to create a mixture of 5'-flanking VNTR amplimers, 
and/or 

d) using a portion of the mixture of adaptor-terminated fragments as templates 
with an adaptor primer and a VNTR antisense primer to create a mixture of 3 -flanking VNTR 
amplimers. 

22. A method of identifying an allele which is linked to a trait of interest, which 
method comprises incubating together under hybridisation conditions: at least one molecule of 
nucleic acid containing a polymorphic allele and its flanking sequences representative of those 
which manifest the trait of interest; and a mixture of molecules of nucleic acid which contain 
polymorphic alleles and their flanking sequences representative of those which do not manifest 
the trait of interest; and selecting at least one match and/or at least one mis-match to provide at 
least one allele or fragment thereof which is linked to the trait of interest. 

23 . The method of claim 22, wherein the alleles are VNTR alleles. 

24. The method of claim 22, wherein the at least one allele and its flanking 
sequences representative of those which manifest the trait of interest, is provided with 3- 
overlapping ends. 

25. Use of the portion of genomic DNA as claimed in claim 14 in a diagnostic 

assay. 

26. The method of claim 1 or claim 16, wherein the VNTR allele and its 
flanking regions, or the mixture of VNTR alleles and their flanking regions, is analysed by being 
applied under hybridsation conditions to an array of immobilised VNTR alleles and/or their 
flanking regions. 

27. A kit comprising protocols and reagents for performing the method of 
claim 1 or claim 16 or claim 24. 
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